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PHARMACOPHORE RECOMBINATION FOR THE IDENTIFICATION OF 
1 0 SMALL MOLECULE DRUG LEAD COMPOUNDS 

FIELD OF THE INVENTION 

The present invention is directed to novel methods for identifying small 
molecule drug lead compounds. 

15 This invention was made with Government support under Contract No. R01 

GM50353 awarded by the National Institutes of Health. The Government has certain 
rights to this invention. 

BACKGROUND OF THE INVENTION 
In response to the ever increasing demand for novel compounds useful in the 

-0 effective treatment of various maladies, the medical research community has 
developed a number of different strategies for discovering and optimizing new 
therapeutic drugs. For the most part, these strategies are dependent upon molecular 
techniques that allow the identification of tightly binding ligands for a given biological 
target molecule. Once identified, these ligands may then carry out their therapeutic 

5 functions by activating, inhibiting or otherwise altering the activity of the molecular 
target to which they bind. 

In one such strategy, new therapeutic drugs are identified by screening 
combinatorial libraries of synthetic small molecule compounds, determining which 
compound(s) have the highest probability of providing an effective therapeutic and 

0 then optimizing the therapeutic properties of the identified small molecule 
compound(s) by synthesizing structurally related analogs and analyzing them for 
binding to the target molecule (Gallop et al., J. Med. Client. 37:1233-1251 (1994), 
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Gordon et al., J. Med. Chem. 37:1385-1401 (1994), Czarnik and Ellman, Acc. Chem. 
Res. 29:112-170 (1996), Thompson and Ellman, Chem. Rev. 96:555-600 (1996) and 
Balkenhohl et al., Angew. Chem. Int. Ed. 35:2288-2337 (1996)). However, this 
process is not only time consuming and costly, it often does not provide for the 
5 successful identification of a small molecule compound having sufficient therapeutic 
potency for the desired application. For example, while the preparation and evaluation 
of combinatorial libraries of small molecules has proven somewhat useful for new 
drug discovery, the identification of small molecules for difficult molecular targets 
(e.g., such as those useful for blocking or otherwise taking part in protein -protein 
10 interactions) has not been particularly effective (Brown, Molecular Diversity 2:217- 
222(1996)). 

One issue that limits the success of combinatorial library approaches is that it is 
possible to synthesize only a very small fraction of the possible number of small 
molecules. For example, greater than 10 60 different small molecules having valid 
15 chemical structures and molecular weights under 600 daltons can be envisioned. 
However, even the most ambitious of small molecule combinatorial library efforts 
have been able to generate libraries of only tens to hundreds of millions of different 
compounds for testing. Therefore, combinatorial technology allows one to test only a 
very small subset of the possible small molecules, thereby resulting in a high 

20 probability that the most potent small molecule compounds will be missed. Thus, 
suitable small molecule compounds having the required availability, activity or 
chemical and/or structural properties often cannot be found. Moreover, even when 
such small molecule compounds are available, optimization of those compounds to 
identify an effective therapeutic often requires the synthesis of an extremely large 

25 number of structural analogs and/or prior knowledge of the structure of the molecular 
target for that compound. Furthermore, screening large combinatorial libraries of 
potential binding compounds to identify a lead compound for optimization can be 
difficult and time-consuming because each and every member of the library must be 
tested. It is evident, therefore, that novel methods for rapidly and efficiently 

*° identifying new small molecule drug leads are needed. 

Living organisms evolve through a process that includes both (1) genetic 
recombination, where sexual reproduction acts to mix and recombine the attributes of 
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the parent organisms to provide progeny having attributes of both parents, and (2) 
natural selection, where only those progeny that are sufficiently "fit" are capable of 
passing their attributes on to the next generation. Approaches that closely model the 
process by which organism evolve have previously been reported for identifying small 
5 molecules that bind to receptors and enzymes (Weber et al., Angew. Chem. Int. Ed. 
Engl 34:2280-2282 (1995) and Singh et al., J. Am. Chem Soc. 1 18:1669-1676 (1996)). 
These approaches are based upon the mathematical method termed "genetic 
algorithms" (Holland, Sci. Am. 66-72 (1992)). Using genetic algorithms, a population 
of different compounds is screened to identify the compounds that bind to the receptor 
10 or enzyme (i.e., the "fittest" compounds). A population of progeny compounds is then 
prepared by recombining the building blocks that were used to prepare the "fittest" 
compounds. A screen is then performed to identify the compounds that bind to the 
target with the highest affinity, which are made up of the optimal building block 
combinations. 

15 However, because the building blocks employed in the genetic algorithm 

approach are not preselected, one of two techniques are used to identify tight binding 
ligands: (1) extremely large populations of compounds must be screened and 
recombined, or (2) multiple rounds of screening and recombination are performed on 
relatively small populations where additional building blocks are gradually introduced 

20 through a process that is analogous to genetic mutation. In this second approach, 
many rounds of selection, recombination and building block introduction are required 
to identify the optimal building block combinations in analogy to the many rounds of 
selection, reproduction and mutation that are required in the evolution of living 
organisms. Thus, the use of genetic algorithms is currently limited because of the 

25 large amount of time required for compound preparation and screening, wherein the 
goal of new drug discovery is to identify a potent compound as quickly as possible. 

Another recently reported approach for identifying high affinity ligands for 
molecular targets of interest is by determining structure-activity relationships from 
nuclear magnetic resonance analysis, i.e., "SAR by NMR" (Shuker et al., Science 

30 274:1531-1534 (1996) and U.S. Patent No. 5,698,401 by Fesik et al.). In this 
approach, the physical structure of a target protein is determined by NMR and then 
small molecule building blocks are identified that bind to the protein at nearby points 
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on the protein surface. Adjacently binding small molecules are then coupled together 
with a linker Th order to obtain compounds that bind to the target protein with higher 
affinity than the unlinked compounds alone. Thus, by having available the NMR 
structure of the target protein, the lengths of linkers for coupling two adjacently 
5 binding small molecules can be determined and small molecule ligands can be 
rationally designed. This approach has been useful for identifying compounds that 
bind to FK506 binding protein with a K d = 20 nM (Shuker et al., supra) and to 
stromelysin with a K d = 15 nM (Hajduk et al., J. Am. Chem. Soc. 119:5818-5827 
(1997) and Hajduk et al., J. Am. Chem. Soc. 119:5828-5832 (1997)). 
) However, while theJSAR by NMR method is powerful, it also has serious 

limitations. For example, the approach requires huge amounts of target protein (>200 
mg) and this protein typically must be l5 N-labeled so that it is useful for NMR studies. 
Moreover, the SAR by NMR approach usually requires that the target protein be 
soluble to >0.3 mM and have a molecular weight less than about 25-30 kDa. 
Additionally, the structure of the target protein is first resolved by NMR, a process 
which often can require a 6 to 12 month time commitment. 

From the above, it is evident that there is a need for novel techniques useful for 
rapidly and efficiently identifying small molecule drug lead compounds that are 
capable of binding with high affinity to a molecular target of interest. We herein 
describe for the first time a method which is based upon pharmacophore 
recombination, wherein a population of small molecule pharmacophores are "pre- 
selected" for the ability to bind to a molecular target and wherein the small molecule 
pharmacophores that bind with the highest affinity are then chemically linked in 
various combinations to provide a library of potential high affinity binding ligands. 
The library of potential binding ligands is then screened using a simple functional 
assay for the presence of one or more compounds that bind to the target molecule with 
very high affinity. 

SUMMARY OF THE INVENTION 

Applicants herein describe a molecular approach for rapidly and efficiently 
identifying small molecule ligands that are capable of binding to a target biological 
molecule with high affinity, wherein ligand compounds identified by the method are 
useful as new small molecule drug lead compounds. The herein described methods 
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allow a library of only the most favorable compounds to be assayed for binding to a 
target biological molecule without the need for screening all possible small molecule 
compounds and combinations thereof for binding to the target as is required in 
standard combinatorial library approaches. More specifically, a library of candidate 
5 target binding fragments is assembled and subjected to a first screen or "pre-screened" 
to identify a subset of that library that bind to a target biological molecule with or 
below a certain dissociation constant. Those candidate target binding fragments 
identified during this "pre-screening" step as being capable of binding to the target 
biological molecule are then coupled or cross-linked in a variety of combinations 
10 using one or more linker elements to provide a library of potential high affinity 
binding ligands or candidate cross-linked target binding fragments, whose building 
blocks represent the small candidate target binding fragments having the highest 
affinity for the target biological molecule as identified in the "pre-screening" step. 
The library of potential ligands or candidate cross-linked target binding fragments for 

15 binding to the target biological molecule is then screened a second time to identify 
those members that exhibit the lowest dissociation constant for binding to the target 
biological molecule. Because the library of candidate target binding fragment 
building blocks is initially "pre-screened" to select for a much smaller set of the most 
favorable building blocks, the most productive building block and cross-linker 

20 combinations can be identified without the laborious task of screening all possible 
combinations of all building blocks coupled together by a set of linkers. The process 
of identifying high affinity drug lead compounds is therefore, greatly expedited. 

With regard to the above, one embodiment of the present invention is directed 
to a method for identifying drug lead compounds that bind to a biological target 

25 molecule of interest, wherein the method comprises the steps of: 

(a) Assembling a library of candidate target binding fragments (CTBF) 
capable of being chemically cross-linked by a cross-linker element to provide 
candidate cross-linked target binding fragments for binding to the target biological 
molecule; 

(b) screening the library of candidate target binding fragments to identify at 
least first and second candidate target binding fragments which bind to the target 
biological molecule; 
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(c) chemically cross-linking the at least first and second candidate target 
binding fragments or structurally related analogs thereof with a cross-linker element to 
provide a library of candidate cross-linked target binding fragments for binding to the 
target biological molecule; and 
5 (d) screening the library obtained in (c) to identify a drug lead compound 

that binds to the target biological molecule. 

In various preferred embodiments, the library of candidate target binding 
fragments may comprise compounds of less than 500 daltons, may comprise simple 
aldehydes, amines, amides, carbamates, ureidos, sulfonamides, alcohols, carboxylic 
10 acids, thiols, aryl halides, alkenes, alkynes, ketones, ethers and/or oximes and/or may 
bind to the target biological molecule with a K„ of 10 mM or lower. In a particularly 
preferred embodiment, the library or candidate target binding fragments may comprise 
oxime compounds, wherein the structurally related aldehyde analogs of those oxime 
compounds are capable of being chemically cross-linked via an O.O'-diamino- 
15 alkanediol cross-linker. Target biological molecules that find use in the described 
methods include, for example, proteins, nucleic acids and saccharides, preferably 
proteins. Preferred TBM's include human or human pathogen proteins, especially 
enzymes, human hormones, human receptors and fragments thereof. These TBM's 
may all contain atoms of naturally occuring isotopic abundance. 
20 In other preferred embodiments, the library of candidate cross-linked target 

binding fragments comprises candidate cross-linked target binding fragments of less 
than about 1000 daltons, that may be homo- or heterodimeric having a K, for the TBM 
of from about 500 uM to about 500 nM or lower. 

Another embodiment of the present invention is directed to a method for 
inhibiting the interaction between first and second biological molecules, wherein the 
method comprises the step of contacting a system comprising both the first and second 
biological molecules with a binding inhibitory amount of a candidate cross-linked 
target binding fragment identified by the above described method, wherein the 
candidate cross-linked target binding fragment binds to one of the first or second 
biological molecules and inhibits their ability to interact. 



25 



30 



- 6 - 



BNSDOCID: <WO 99493 14A1 I > 



WO 99/49314 



PCT/US99/06734 



A further embodiment of the present invention is directed to a drug lead 
compound made by the the method described herein, where the compound is 
represented by the formula: 



TBF m 


— L&r- 


XL 


LGy— 


TBF n 



and 



TBF m -part A TBF n -part C 

LG3 — I XL I LG 4 

TBF m -part B | | TBF n -part D 

where 

TBF m represents a first TBF selected from step (d); 
TBF o represents a second TBF selected from step (d); 

TBF m -part A and B represent TBF m from step (d) where each fragment is 
bonded to a single atom in LG 3 ; 

TBF n -part C and D represent TBF n from step (d) where each fragment is 
bonded to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
-(C 0 -C 2 -aIkyl-L , -L 2 -L 3 -L 4 -L s -C 0 -C a -alkyI)-; 

LG, and LG 2 are linking groups independently selected from the group - 
C(R>N-CK -0-N=C(R a K -CH^NCR.)-, -N^-CH,-, -C(=0)-N(R.)-, -N(R>C(=OK 
-N(R>C(=0)-0- t -0-C(=0)-N(R>, -N(R>C(=0)-N(R b )-, -N{R±C(=OyN{KD-* - 
SO a -N(R> and -N(R a )-S0 2 -; 

LG 3 and LG 4 are linking groups independently selected from the group >C=N- 
O-, -0-N=C<, -CH 2 -N<, >N-CH 2 -, -C(=0)-N<, >N-C(=0)-, >N-C(=0)-0-, -0-C(=0)- 
N<, >N-C(=0)-N(R b )-, -N(R>C(=0)-N<, -S0 2 -N< and >N-SO r , where < and > 
represent two bonds linking CTBF- part A, B, C, or D to the single N or C atom in 
LG 3 or LG 4 ; 

R a and R h are independently selected from the group 
hydrogen, C,-C I0 -alkyi, C 0 -C 10 -alkyl-C 6 -C to -aryU C 6 -C 10 -aryl-C 0 -C I0 -alkyl, C 0 -C I0 -alkyl- 
heterocycle-C 0 -C l0 -alkyi, C r C 6 -alkyl-NH-C,-C 6 -alkyl, C 0 -C l0 -alky)-O-C 0 -C I0 -alkyI, C 0 - 
C l0 alkyl-C(=O).C 0 -C l0 -alkyl 1 C 0 -C l0 -alkyl-NH-C(-O)-C 0 -C I0 -alkyl, C 0 -C, 0 -alkyl-O- 
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C(=O)-C 0 -C 10 -alkyI, where any alkyl, aryl or heterocycle is optionally substituted 
with C r C 10 -afkyl, C.-C^-alkoxy, C 6 -C l0 -aryl, C 6 -C !0 -aryloxy, halo (F, CI, Br, 1), 
hydroxy, carboxy, amino, nitro and S(O) 03 ; 

TBF m , TBF n , TBF n -part A, TBF m -part B, TBF n -part C and TBF n -part D are 
5 each independently represented by formula I 



-A-(Cycle l)-B-(Cycle 2)-E (I) 

Where 

Cycle 1 and Cycle 2 are independently present or absent and are selected from 
a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 
6 or 7 atoms in the ring where the ring atoms are carbon or from 1-4 heteroatoms 
selected from; nitrogen, oxygen, and sulfur, and where any sulfur ring atom may 
optionally be oxidized and any carbon ring atom may form a double bond with O, 

NR n and CR*R* , each ring nitrogen may be substituted with R n and any ring carbon 



10 



20 



25 



30 



may be substituted with R^; 



A and B are independently selected from 
L 3 — L 2 L 1 



■L 4 — L 3 — L 2 — L 1 and 

-L 5 — L 4 L 3 L 2 — L 1 



where: 

L 1 is absent or may be selected from oxo (O), S(0)^ C(=0), C(=N-R n ), 

C(=CR 1 R 1 \ Qr'r 1 '), C(R ! ), C, het, N(R n ) or N; 

L is absent or may be selected from oxo (O), S(0) s , C(=0), C(=N-R n ), 

C(=CR 2 R 2 '), C(R 2 R 2 \ C(R 2 ), C, het, N(R n ) or N; 

L is absent or may be selected from oxo (O), S(0) g , C(=0), C(=N-R n ), 



CC^R^R- 3 ), C(R R ) C(R J ), C, het, N(R n ) or N; 



- 8 - 



BNSDOCID: <WO_9949314A1_l_> 



WO 99/49314 



PCT/US99/06734 



L is absent or may be selected from oxo (O), S(0) s , C(=0), C(=N-R n ), 

C(=CR 4 R 4 \ C(R 4 R 4 ), C(R 4 ), C, NR n or N; and 

L 5 is absent or may be selected from oxo (O), S(0) s , C(=0), C(=N-R n ), 

C(R 5 R 5 ), C(=CR 5 R 5 '), C(R 5 ), C, NR n or N; 

R , R , R , R , R , R , R \ R^ , R J and R^ each are independently 
selected from R a , R a t R° and U-Q-V-W; where s is 0-2 

Optionally, each R*-R 5 or NR n together with any other R*-R 5 or NR n may 

10 

form a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being 
a homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring 
containing 1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur 

atom may optionally be oxidized, each ring nitrogen optionally substituted with R n 

1 5 and each ring carbon optionally substituted with R^; 

E is -L^-L^-L^-R 3 ; 

R a is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
C 1 -C 11 alkyl, halo(F, CI, Br, IK^-Cj jalkoxy, hydroxy-Cj-Cjjalkyl, cyano, 

20 isocyanate, carboxy-C^Cj jalkyi, amino, C^-Cj jalkyl-amino-CCj-Cgalkyl), C Q - 

Cj jalkyl-amino-di-CCj-Cgalkyl), aminocarbonyl, C i ~ C i jalkylcarbonylamino, 

carboxamido, carbamoyl, carbamoyloxy, formyl, formyloxy, azido, nitro, hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, C^-C^alkoxy, 

25 mercapto, suifonamido, het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyl and indolyl; 

R a is selected from the group of C 0 -C 1() alkyl-Q-C 0 -C 6 alkyl, C 0 -C 1() alkenyl- 

Q C 0' C 6 a,ky1 ' C 0 -C 10 alkynyl-Q-C 0 -C 6 alkyl, CyC^ jCycloalkyl-Q-CQ-C^lkyl, C 3 - 

M) C 1() cycloalkenyl-Q-C 0 -C 6 alkyl, C r C 6 alkyl-C 6 -C 12 aryl-Q-C 0 -C 6 alkyl, Cg-C^aryl- 

Cj-Cgalkyl-Q-C^CgalkyU C 0 -C 6 alkyl-het-Q-C 0 -C 6 aIkyl, C 0 -C 6 alkyl-Q-het-C 0 - 
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C 6 alkyl, het-C 0 -C 6 alkyl-Q-C 0 -C 6 alkyl, C^Cgalkyl-Q-Cg-C^aryl and Q-C { - 

C^lky, where-any aryl or het is optionally substituted with 1-3 R d and any alkyl, 

alkenyl or alkynyl is optionally substituted with 1-3 R a ; 
a a' 

R and R may join to form a 3-7 member homocyclic ring substituted with 1- 



3R a ; 



R c is selected from hydrogen and substituted or unsubstituted; amino, O-Cj- 
C g alkyl, amino-tCj-Cgalkyl), amino-dKCj-Cgalkyl), Cj-C^alkyl, C 2 -C 1() alkenyl, 
10 c 2 - C io alkyny1, C 3 -C u cyclpalkyl, C 3 -C 1() cycloalkenyl, Cj-Cgalkyl-Cg-C aryl, 
C 6" C 10 aryl " C l" C 6 alky1, c 1 - c 6 a]k y ] ~ het ' het-Cj-Cgalkyl, Cg-C^aryl and het, where 

the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any 
aryl or het are 1-3 R d ; 
15 R d is selected from R h and R p ; 

R h is selected from the group OH, OCF 3 , OR c , SR m , halo(F, CI. Br, I), CN, 

isocyanate, N0 2> CF 3> C 0 -C 6 alkyl-NR n R n ', C 0 -C 6 alkyl-C(=O)-NR n R n ', C Q - 

20 C 6 alkyl-C(=0)-R a , C r C g alkyl, Cj-Cgalkoxy, C^Cgalkenyl, C^Cgalkynyl, C y 
C 6 cycloalkyl, C^-CgCycloalkenyl, Cj-Cgalkyl-phenyl, phenyl-C^Cgalkyl, Cj- 
C 6 alkyloxycarbonylamino, Cj-Cgalkyloxycarbonyl- C Q -C 6 alkyl, phenyI-C Q - 
C 6 alkyloxy, Cj-C^Ikyl-het, het-Cj-C^lkyl, SO^het, O-Cg-C^aryl, SO -C - 
C 12 aryl ' s °2" C i" C 6 a,k y' and het > wnere any alkyl, alkenyl or alkynyl may optionally 
be sustituted with 1-3 groups selected from OH, halo(F, CI, Br, I), nitro, amino and 
aminocarbonyl, where the substituents on any aryl or het are 1-2 hydroxy, ha!o(F, CI, 
Br, I), CF 3 , Cj-C 6 alkyl, Cj-C^alkoxy, nitro and amino; 

R m is selected from hydrogen, S-Cj-Cgalkyl, C(=0)-C 1 -C ( ,alkyl, C(=0)- 
n n * 

NR R ,C r C 6 alkyl,halo(F, CI, Br, IJ-Cj-C^lkyl, benzyl and phenyl; 
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R n is selected from the group R c , OH, OCF 3> OR°, CN, isocyanate, NH- 

C(=0)-0-R c , NH-C(=0)-R c , NH.C(=0)-NHR c , NH-S0 2 -R s , NH-S0 2 -NH-C(=0)- 
R c , NH-C(=0)-NH-S0 2 -R s , C(=0)-0-R°, C(=0)-R c , C(=0)-NHR C , C(=0)-NH- 
5 C(=0)-0-R°, C(=0)-NH-C(=0)-R c , C(=0)-NH-S0 2 -R s , C(=0)-NH-S0 2 -NHR c , 
S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2 , S0 2 -NH-C(=0)-0-R°, S0 2 -NH-C(=0)-0-R° and 
SO^-NH-C(=0)-R c ; 

10 R° is selected from hydrogen and substituted or unsubstituted C, -C^alkyl, CL- 

1 6 J 0 

C 6 alkyl-C 6 -C 1() aryl, Cj-C^lkylcarbonyl, C^Cgalkenyl, C 2 -C 6 alkynyl, C 3 - 

CgCycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

j ^ substituents on any aryl are 1-3 RP; 

R p is selected from the group; OH, COOH, COH, NH^ C Q -C 6 alkyl, halo(F, 

CI. Br, I), CN, isocyanate, OR°, SR m , SOR°, N0 2 , CF 3> R c , NR n R n ', N(R n )-C(=0)- 

0-R°, N(R n )-C(=0)-R c , S0 2 -R s , C 0 -C 6 alkyl-SO 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', 

C(=0)-R c , 0-C(=0)-R c , C(=0)-0-R° and C(=0)-NR n R n ', where the substituits on 
any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or het are 1-3 

R d ; 

„ R s is a substituted or unsubstituted group selected from; C.-CLalkyl, C 0 - 

C g alkenyl, C 2 -C g alkynyl, C 3 -C g cycloalkyI, C 3 -C 6 cycloalkenyl, C Q -C 6 alkyl- C^- 
C 1() aryl, C 6 -C 10 aryl-C 0 -C 6 alkyl, C 0 -C 6 alkyl-het and het-C 0 -C 6 alkyl, where the 

substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl 
30 or het are 1-3 R d ; 

het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-membered ring containing from one to four 
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heteroatoms selected from the group nitrogen, oxygen, and sulfur, the 5-membered 
ring having from 0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 
3 double bonds and where any carbon or sulfur atoms in the ring may optionally be 
oxidized, and where any nitrogen heteroatom may optionally be quaternized and where 

5 any ring may contain from 0-3 R^; 

U is an optionally substituted bivalent radical selected from the group; C^- 

C^alkyl, C 0 ~C^alkyl-Q, C^C^alkenyl-Q, and C^C^alkynyl-Q, where the substituits 

on any alkyl, alkeny! or alkynyl are 1-3 R ; 

10 Q is absent or is selected from the group; -O-, -S(O) -SO -N(R n )-, -N(R n )-, 

s £ 

-N(R n )-C(=OK -N(R n )-C(=0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C(=0)-0-, -het-,-C(=0)- 



N(R )-, -PO(OR )0- and -P(0)0-, where s is 0-2 and the heterocyclic ring is 

1 5 substituted with 0-3 R ; 

V is absent or is an optionally substituted bivalent group selected from Cj- 

C^alkyl, C^-CgCycIoalkyl, CQ-C^alkyl-Cg-C^aryl, and C^-C^alky-net, where the 

a d 
substituits on any alkyl are 1-3 R and the substituents on any aryl or het are 1-3 R ; 

20 W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n ', -NH-C(=0)- 

0-R°, -NH-C(=0)-NR n R n ', -NH-C(=0)-R c , -NH-S0 2 -R s , -NH-SC» 2 -NR n R n ', -NH- 

S0 2 -NH-C(=0)-R c , -NH-C(=O)-NH-S0 2 -R s , -C(=0)-NH-C(=0)-0-R°, -C(=0)-NH- 

25 C(=0)-R c , -C(=0)-NH-C(=0)-NR n R n ', -C(=0)-NH-S0 2 -R s , -C(=0)-NH-S0 2 - 



NR R , -C(=S)-NR n R" , -SO^R 5 ", -SCyO-R , -SC» 2 -NR n R n , -SC» 2 -NH-C(=0)-0- 
R°, -S0 2 -NH-C(=0)-NR n R n \ -S0 2 -NH-C(=0)-R c , -0-C(=0)-NR n R n ', -0-C(=0)- 

R c , -0-C(=0)-NH-C(=0)-R c , -0-C(=0)-NH-SO -R s and -O-SO -R s ; 

Optionally, TBF m -part A together with TBF ro -part B and TBF n -part C together 
with TBF n -part D may independently form Cycle 1 substituted with -B-(Cyc!e 2)-E. 
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A drug lead precursor or intermediate of this invention is represented by C 0 -C 2 - 

alkyl-L'-L 2 -L 3 -L 4 -L 5 -C 0 -C 2 -alkyI where L 1 through L 5 are defined above. Additional 
embodiments of the present invention will become evident to the ordinarily skilled 
artisan upon a review of the present specification. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a synthetic reaction wherein an aldehyde is reacted with O- 
methyl hydroxylamine to produce an O-methyl oxime compound. 

Figure 2 shows a variety of organic aldehyde pharmacophore molecules which 
after conversion to )-methyl oximes were identified as being capable of inhibiting the 
interaction between CD4 and gpl20. 

Figure 3 shows chemistry useful for chemically coupling two aldehydes via an 

0. 0'-diamino-alkanediol linker to produce both heterodimeric and homodimeric 
oxime compounds. 

Figure 4 shows a variety of aldehyde pharmacophores found to be highly 
efficient in dimeric form for inhibiting the interaction between CD4 and gpl20. 

Figure 5 shows an O-methyl oxime compound found to be particularly 
effective in dimeric form for inhibiting the interaction between CD4 and gpl20. 

Figure 6 shows a specific ligand having particularly high activity for inhibiting 
the interaction between CD4 and gpl20. 

Figure 7 shows a synthetic reaction wherein an aldehyde is reacted with a 
dimethylamine in the presence of support-bound triacetoxyborohydride to give rise to 
an N,N-dimethylamine organic compound. 

Figure 8 shows a chemical synthesis sequence resulting in the production of a 
ligand of the present invention. 

Figure 9 shows a flow diagram for the fragment assembly sequence resulting in 
the production of a drug lead compound of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

1. General Description 
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The present invention provides a rapid and efficient method for identifying 
small molecule candidate target binding fragments or ligands that are capable of 
binding with high affinity to target biological molecules (TBM) of interest. The 
compounds identified by the subject method find use, for example, as drug lead 
5 compounds for the development of novel therapeutic drugs. The subject method 
involves the assembly of a library of small organic candidate target binding fragments 
which are capable of being chemically cross-linked via a linker element. The library 
of organic candidate target binding fragments may be "pre-screened" in order to 
identify members of the library that are capable of binding to a target biological 
10 molecule. At least a portion pf the small organic candidate target binding fragments 
identified during the "pre-screen" as being capable of binding to the target are then 
chemically cross-linked to one another in various combinations to provide a library of 
potential or candidate cross-linked target binding fragments for binding to the target 
biological molecule. The library of candidate cross-linked target binding fragments 
15 having potential to be high affinity antagonists (or agonists) are then screened to 
identify drug lead compounds that bind to the target biological molecule with high 
affinity. The step of "pre-screening" the library of small organic candidate target 
binding fragments to identify those that are capable of binding to the target allows one 
to limit the library of potential binding ligands to only those that are comprised of the 

20 most favorable organic candidate target binding fragment building blocks, thereby 
decreasing the required complexity of the library of potential ligands while increasing 
the probability of identifying a molecule exhibiting high binding affinity for the 
biological target molecule. 

One embodiment of the present invention is directed to a method for 

25 identifying a drug lead compound that binds to a target biological molecule of interest. 
The subject method involves assembling a library of organic candidate target binding 
fragment(s) that are capable of being chemically cross-linked via a chemically 
compatible cross-linker to provide candidate cross-linked target binding fragment(s) 
for binding to the target biological molecule. In this regard, the phrase "assembling a 

-™ library of organic candidate target binding fragments" is to be construed broadly and is 
intended to encompass all means by which one may obtain a library comprising two 
or more organic compounds which include, for example, obtaining such compounds 
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from a commercial or non-commercial source, synthesizing such compounds using 
standard chemical synthesis technology or combinatorial synthesis technology (see 
Gallop et al. (1994), supra, Gordon et al. (1994), supra, Czarnik and Ellman (1996), 
supra, Thompson and Ellman (1996), supra and Balkenhohl et al. (1996) supra) and 
S obtaining such compounds as degradation products from larger precursor compounds, 
e.g. known therapeutic drugs, large chemical molecules, and the like. 

The candidate target binding fragments (CTBF) and are, for the most part, 
small water soluble organic molecules that have one or more chemically reactive 
functionalities, referred to as linkable (or linkage) functional groups (LFG) (or sites 
10 that may be readily converted to a chemically reactive functionality using standard 
technology (BLFG)) that provide a site for coupling to another compound or candidate 
target binding fragment via a chemically compatible cross-linker. Thus, the candidate 
target binding fragments of the present invention are capable of being chemically 
coupled to one another via a cross-linker to provide candidate cross-linked target 

15 binding fragments for binding to the target biological molecule, meaning that the 
candidate target binding fragment compounds have a reactive functionality, or a site 
that can be readily chemically converted to a reactive functionality, where a 
chemically compatible cross-linker may covalently attach thereto, thereby allowing 
multimerization of the candidate target binding fragments through the cross-linker. 

20 "Ligands", "candidate ligands' or "candidate cross-linked target binding fragments " 
for binding to a "target biological molecule" for purposes herein are compounds that 
are obtained from reacting two or more organic compounds, which may be the same or 
different, preferably different, with one or more cross-linker so as to produce a 
molecule comprising two or more target binding fragments and one or more cross- 

25 linker. Such ligands are referred to herein as candidate cross-linked target binding 
fragments(CXL-TBF). 

Candidate target binding fragments having the linkage functional group 
modified or blocked so that it contains substantially the same linking group as is found 
in the candidate cross-linked binding fragments are sometimes referred to as 

•"W monomers. Monomers or monomelic compounds that find use in the present invention 
include, for example, aldehydes, ketones, oximes, such as O-alkyl oximes, preferably 
O-methyl oximes, hydrazones, semicarbazones, carbazides, primary amines, secondary 
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amines, such as N-methylamines, tertiary amines, such as N,N-dimethylamines, N- 
substituted hydrazines, hydrazides, alcohols, ethers, thiols, thioethers, thioesters, 
disulfides, carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, 
thioketals, acetals, thioacetals, aryl halides, aryl sulfonates, alkyl halides, alkyl 
5 sulfonates, aromatic compounds, heterocyclic compounds, anilines, alkenes, alkynes, 
diols, amino alcohols, oxazolidines, oxazolines, thiazolidines, thiazolines, enamines, 
sulfonamides, epoxides, and aziridines, and the like, all of which have chemically 
reactive functionalities (or are directly prepared from precursor compounds that have 
chemically reactive functionalities) capable of linking, either directly or indirectly, to a 
10 cross-linker . In fact, virtuaUy any small organic molecule that is capable of being 
chemically coupled to another small organic molecule may find use in the present 
invention with the proviso that it is sufficiently soluble in aqueous solutions to be 
tested for its ability to bind to a target biological molecule. 

The above described monomers or candidate target binding fragments will 
15 serve as the individual building blocks for candidate cross-linked binding fragments 
prepared therefrom. Candidate target binding fragments that find use herein will 
generally be less than about 2000 daltons in size, usually less than about 1500 daltons 
in size, more usually less than about 750 daltons in size, preferably less than about 500 
daltons in size, often less than about 250 daltons in size and more often less than about 

20 200 daltons in size, although organic molecules larger than 2000 daltons in size will 
also find use herein. Candidate target binding fragments that find use may be 
employed in the herein described method as originally obtained from a commercial or 
non-commercial source (for example, a large number of small organic chemical 
compounds that serve as candidate target binding fragments are readily obtainable 

25 from commercial suppliers such as Aldrich Chemical Co., Milwaukee, WI and Sigma 
Chemical Co., St. Louis, MO) or may be obtained by chemical synthesis. Examples of 
the latter include the preparation of a library of organic oxime compounds from a 
single step condensation of commercially available aldehydes with O-alkyl 
hydroxylamine as described in Example I below and the preparation of a library of 

™ N,N-dimethylamine candidate cross-linked target binding fragments from the 
reductive amination of commercially available aldehydes and dimethylamine using 
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support-bound triacetoxyborohydride as described in Example II below and by Kaldor 
et al., Tetrahedron Lett. 37:7193-7196 (1996). 

Libraries of candidate target binding fragments or candidate cross-linked 
target binding fragments which find use herein will generally comprise at least 2 
5 organic compounds, often at least about 25 different organic compounds, more often 
at least about 100 different organic compounds, usually at least about 500 different 
organic compounds, more usually at least about 1000 different organic compounds, 
preferably at least about 2500 different organic compounds, more preferably at least 
about 5000 different organic compounds and most preferably at least about 10,000 or 
1 o more different organic compounds. Libraries may be selected or constructed such that 
each individual molecule of the library may be spatially separated from the other 
molecules of the library (e.g., each member of the library is present in a separate 
microtiter well) or two or more members of the library may be combined if methods 
for deconvolution are readily available. The methods by which the library of organic 
1 5 compounds are prepared will not be critical to the invention. 

Once assembled, a library of organic candidate target binding fragments will be 
screened using one of any number of different known assays for the purpose of 
identifying candidate cross-linked target binding fragments that are capable of binding 
to a target biolocical molecule of interest. "Biological target molecules", "target 

20 biological molecules", "target biomolecules", "molecular targets", "biological targets", 
and other grammatical equivalents refer to target biological molecules (TBM) that are 
available (either commercially, recombinantly, synthetically or otherwise) in sufficient 
quantities for use in in vitro binding assays and for which there is some interest for 
identifying a high affinity binding partner. For the most part, target biological 

25 molecules are proteins, including human proteins or human pathogen proteins that 
may be associated with a human disease condition, such as cell surface and soluble 
receptor proteins, such as lymphocyte cell surface receptors, enzymes, such as 
proteases, clotting factors, serine/threonine kinases and dephosphorylases, tyrosine 
kinases and dephosphorylases, bacterial enzymes, fungal enzymes and viral enzymes, 

30 signal transduction molecules, transcription factors, proteins associated with DNA 
and/or RNA synthesis or degradation, immunoglobulins, hormones, receptors for 
various cytokines including, for example, erythropoietin/EPO, granulocyte colony 
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stimulating receptor, granulocyte macrophage colony stimulating receptor, 
thrombopoietiff (TPO), IL-1, IL-2, EL-3, IL-4, IL-5, IL-6, IL-10, IL-11, IL-12, growth 
hormone, prolactin, human placental lactogen (LPL), CNTF, octostatin, various 
chemokines and their receptors such as RANTES, MlPl-a, IL-8, various ligands and 
receptors for tyrosine kinases such as insulin, insulin-like growth factor 1 (IGF-1), 
epidermal growth factor (EGF), heregulin-a and heregulin-(3, vascular endothelial 
growth factor (VEGF), placental growth factor (PLGF), tissue growth factors (TGF-a 
and TGF-{3), other hormones and receptors such as bone morphogenic factors, folical 
stimulating hormone (FSH), and leutinizing hormone (LH), tissue necrosis factor 
(TNF), apoptosis factor- 1 and -2 (AP-1 and AP-2), and proteins and receptors that 
share 20% or more sequence identity to these, and the like, nucleic acids, including 
both DN A and RNA, saccharide complexes, and the like. 

For the step(s) of screening libraries of candidate target binding fragments or 
candidate cross-linked target binding fragments for members having the ability to bind 
to a target biological molecule of interest, a simple ELISA assay may be used to (a) 
identify member(s) of the library that are capable of binding to the target, and (b) 
determine the approximate K d with which the library member(s) bind to the molecular 
target. While ELISA assays are preferred for screening libraries of organic 
compounds, virtually any in vitro assay that allows one to detect binding of the target 
biological molecule by an organic compound may be employed for screening the 
library, wherein such assays include ELISA, other sandwich-type binding assays, 
binding assays which employ labeled molecules such as radioactively or fluorescently 
labeled molecules, fluorescence depolarization, calorimetry, protein denaturation, 
resistance to proteolysis, gel filtration, equilibrium dialysis, surface plasmin resonance, 
X-ray crystallography, and the like. Such assays either measure the ability of library 
members to bind directly to the target biological molecule or are competition binding 
assays designed to measure the ability of library members to inhibit the interaction 
between the target biological molecule and another molecule that binds to the target 
biological molecule. Any of the above assays may be employed to screen libraries of 
candidate compounds to identify those that bind to a target biological molecule. 

For the step of screening a library of candidate compounds to identify those 
that bind to a target biological molecule, it will be well within the skill level in the art 
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to determine the concentration of the library members to be employed in the binding 
assay. For the most part, the screening assays will employ concentrations of candidate 
compounds ranging from about 0.01 to 10 mM, preferably from about 0.05 to 5 mM. 
The step of pre-screening a library of candidate target binding fragments to 
5 identify those that bind to a target biological molecule allows one to identify and 
isolate only those members of the library that have some binding affinity for the target. 
As such, in contrast to standard combinatorial library approaches, the small organic 
building blocks are "pre-screened" to select a smaller set of compounds that have 
some binding affinity for the target. Thus, the most productive organic compound 
10 building blocks can be identified for incorporation into the potential high affinity 
candidate cross-linked target binding fragments that are prepared therefrom, without 
having to screen all possible combinations of all of the initial candidate target binding 
fragment building blocks. Generally, the candidate target binding fragment library 
members selected as building blocks for subsequently prepared candidate cross-linked 
15 target binding fragments are those that have the highest affinity for binding to the 
target biological molecule. For the most part, candidate target binding fragments 
chosen as building blocks for incorporation into the subsequently prepared candidate 
cross-linked target binding fragments are those that bind to the target biological 
molecule with a K, of about lOmM or less, usually about 5mM or less, more usually 
20 about ImM or less, preferably about 500 |LtM or less, more preferably about 100 [iM or 
less and most preferably about 50 JJ.M or less. However, for some applications, one or 
more of the candidate target binding fragment(s) chosen for incorporation into the 
subsequently prepared candidate cross-linked target binding fragments may have an 
individual K d for the target biological molecule of greater than 10 mM. 

25 

Once candidate target binding fragments that bind to a target biological 
molecule with some desired degree of affinity are identified, at least a portion of those 
compounds (or structurally related analogs thereof) are chemically coupled via a cross- 
linker to provide a library of candidate cross-linked target binding fragments for 
binding to the target biological molecule, wherein those candidate cross-linked target 
binding fragments comprise at least one candidate target binding fragment reacted or 
linked with a cross-linker. Usually two or more candidate target binding fragments (or 
structurally related analogs thereof) linked by a cross-linker are combined and in some 



- 19 - 



WO 99/49314 



PCT/US99/06734 



cases these two fragments are the same. The two (or more) candidate target binding 
fragments (or structurally related analogs thereof) incorporated into a candidate cross- 
linked target binding fragment may be the same (i.e., to provide a homodimer or 
homomultimer) or different (i.e., to provide a heterodimer or heteromultimer). Most 
5 commonly, the two candidate target binding fragments in the candidate cross-linked 
target binding fragment are different. 

By "structurally related analog", "analog" and the like, is meant a fragment 
compound that has the same chemical structure as a fragment identified as being 
capable of binding to the target biological molecule except that the analog has a 
10 different chemically reactive functionality or linkage functional group(LFG) for 
binding to the cross-linker than does the fragment that was identified as being capable 
of binding to the target biological molecule in the first or pre-screen. The analog may 
also optionally possess or lack one or more substituents that are either lacking or 
present, respectively, on the fragments identified in the pre-screening provided that 
15 the presence or absence of those substituents does not substantially alter the 
compounds ability to bind to the target. As such, while one may pre-screen a library 
of, for example, candidate oxime compounds to identify candidate oxime fragments 
that bind to the target biological molecule, one can chemically couple not the actual 
oxime compounds identified in the "pre-screening" but rather aldehydes having that 

20 same chemical structures as the oximes identified in the screen (but which have an 
aldehyde reactive functionality rather than an oxime reactive functionality). The 
present invention, therefore, encompasses not only chemical coupling of the actual 
compounds identified in the initial "pre-screening step" (e.g., aldehydes are pre- 
screened and also subsequently linked), but also the chemical coupling of structurally 

25 related analogs of those compounds (e.g., oximes are pre-screened but the analogous 
aldehydes are linked). 

As described above, candidate target binding fragment will comprise a 
chemically reactive functionality or linkage functional group (LFG) (or a site that can 
be converted to a chemically reactive functionality (BLFG)) to which a cross-linker 

30 may be covalently bound, thereby providing a means for cross-linking two or more 
candidate target binding fragments having a LFG ( or analog or blocked form 
thereof) to provide a candidate cross-linked target binding fragment. Therefore, cross- 
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linkers that find use herein will be multifunctional, preferably bifunctional, cross- 
linking molecules that can function to covalently bond at least two fragment 
compounds together via their reactive functionalities or LFG's. Linkers or cross- 
linkers (XL) will have at least one, two or more and preferably only two, chemically 
5 cross-reactive functional groups (CFG) on the cross-linker (XL). The chemically 
cross- reactive functional groups that are available for bonding to two or more 
candidate target binding fragments, wherein those functional groups may appear 
anywhere on the cross-linker, preferably at each end of the cross-linker and wherein 
those chemically cross-reactive functional groups may be the same or different 
1 o depending upon whether the candidate target binding fragment is to be linked have the 
same or different chemically cross-reactive functional groups. Cross-linkers that find 
use herein may be substituted or unsubstituted straight-chain or branched alkyl, aryl, 
alkaryl, heteroaryl, heterocycle and the like. Preferably straight chain alkyl will 
generally be at least one methylene in length, more generally from 2 to 8 methylenes 
1 5 in length, and optionally as many as about 12 or more methylenes or the equivalent in 
length. Cross-linkers may include atoms or groups to increase or improve solubility of 
the library members, such as oxygen atoms interspersed between methylene groups 
creating ether or polyether linkers. Cross-linkers will generally comprise alkyl 
groups either saturated or unsaturated, and therefore, may comprise alkanes, alkenes or 
20 alkynes. Heteroatoms including nitrogen, sulfur, oxygen, and the like may also be 
appended to the alkyl to form groups such as; alkoxyl, hydroxyalkyl or hydroxy 
groups. Other cross-linking groups such as aryl, especially phenylene or substituted 
phenylene linkers are suitably employed. Usually cross-linker elements will be of 
varying lengths, thereby providing a means for optimizing the binding properties of a 
25 cross-linked target binding fragment prepared therefrom. 

In particularly preferred embodiments, cross-linkers may be O.O'-diamino- 
alkanediol compounds, preferably 0,0*-diamino-C,-C t alkanediol, which are useful for 
chemically coupling aldehyde organic compounds, or any of a variety of different 
diamine compounds, which are useful for chemically coupling aldehyde containing 
30 compounds. 

Various chemistries may be employed for chemically coupling candidate 
target binding fragments via a cross-linker to provide candidate cross-linked target 
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binding fragments for binding to a target biological molecule. For example, many 
well known chemistries that can be employed for chemically coupling candidate target 
binding fragments(s) via a linker to form candidate cross-linked target binding 
fragments include, for example, reductive aminations between aldehydes and ketones 
5 and amines (March, Advanced Organic Chemistry, John Wiley & Sons, New York, 
4th edition, 1992, pp.898-900), alternative methods for preparing amines (March et al., 
supra, p. 1276), reactions between aldehydes and ketones and hydrazine derivatives to 
give hydrazones and hydrazone derivatives such as semicarbazones (March et al., 
supra, pp.904-906), amide bond formation (March et al., supra, p. 1275), formation of 

10 ureas (March et al., supra, p.4299), formation of thiocarbamates (March et al., supra, 
p.892), formation of carbamates (March et al., supra, p. 1280), formation of 
sulfonamides (March et al., supra, p. 1296), formation of thioethers (March et al., 
supra, p. 1297), formation of disulfides (March et al., supra, p. 1284), formation of 
ethers (March et al., supra, p.1285), formation of esters (March et al., supra, p.1281), 

1 5 additions to epoxides (March et al., supra, p.368), additions to aziridines (March et al., 
supra, p.368), formation of acetals and ketals (March et al., supra, p.1269), formation 
of carbonates (March et al., supra, p.392), formation of enamines (March et ah, supra, 
p. 1284), metathesis of alkenes (March et al., supra, pp.1 146-1 148 and Grubbs et al., 
Acc. Chem. Res. 28:446-452 (1995)), transition metal-catalyzed couplings of aryl 

20 

halides and sulfonates with alkenes and acetylenes (e.g., Heck reactions) (March et al., 
supra, pp.7 17- 178), the reaction of aryl halides and sulfonates with organometallic 
reagents (March et al., supra, p.662), such as organoboron (Miyaura et al., Chem. Rev., 
95:2457 (1995)), organotin, and organozinc reagents, formation of oxazolidines (Ede 
et al., Tetrahedron Letts. 38:7119-7122 (1997)), formation of thiazolidines (Patek et 

25 

al., Tetrahedron Letts. 36\2221-Tn§ (1995)), amines linked through amidine groups 
by coupling amines through imidoesters (Davies et al., Canadian 7. Biochem. 50:416- 
422(1972)), and the like. 

The step of chemically cross-linking, via a cross-linker, at least a portion of the 
candidate target binding fragments identified as described above as being capable of 
binding to the target biological molecule or structurally related analogs thereof 
provides a library of candidate cross-linked target binding fragments for binding to the 
target molecule that comprise at least two of the candidate target binding fragments 
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or analogs thereof and the cross-linker. As previously stated the candidate target 
binding fragments incorporated into candidate cross-linked target binding fragments 
may be the same, thereby providing a homomultimer, or different, thereby providing a 
heteromultimer, and libraries of candidate cross-linked target binding fragments 
5 generally comprise both homo- and hetero-multimers. Candidate cross-linked target 
binding fragments for binding to the target molecule are preferably dimeric, however, 
candidate cross-linked target binding fragments that find use may also be trimeric, 
tetrameric, and the like, those compounds being obtained by employing cross-linkers 
having more than two chemically cross-reactive functional groups for cross-linking 
10 purposes. Candidate cross-linked target binding fragments for binding to a target 
biological molecule that find use herein will generally be less than aboutlOOO daltons 
in size and often less than about 750 daltons in size. 

Libraries of candidate cross-linked target binding fragments for binding to the 
target biological molecule will generally comprise at least 1 candidate cross-linked 
15 target binding fragment, usually at least about 20 different candidates, more usually at 
least about 100 different candidates, preferably at least about 200 different candidates, 
more preferably at least about 500 different candidates, most preferably at least 1,000 
different candidates and often 10,000 or more. Libraries of candidate cross-linked 
target binding fragments may be constructed such that each individual molecule of the 
20 library may be spatially separated from the other molecules of the library (e.g., each 
member of the library is in a separate microtiter well) or two or more members of the 
library may be physically combined if methods for decon volution are readily available. 

Once obtained, libraries of candidate cross-linked target binding fragments for 
binding to the target biological molecule will be screened for the purpose of 
25 identifying a member(s) of the library that is/are capable of binding to the target 
biological molecule with high affinity. For such purposes, any of the above described 
screening assays can be employed, wherein preferably a biological assay such as an 
ELISA assay is employed. 

For the step of screening a library of candidate cross-linked target binding 
fragments to identify one or more that bind to a target biological molecule, it will be 
well within the skill level in the art to determine the concentration of the compounds to 
be employed in the binding assay. We have herein found that candidate cross-linked 
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target binding fragments generated by chemically coupling organic compounds that 
bind to a target' biological molecule often exhibit surprisingly high binding affinities 
for the target. For the most part, candidate cross-linked target binding fragments that 
serve as potential drug lead compounds or may have therapeutic efficacy on their own 
5 bind to the target biological molecules with a K d of about 500 nM or less, usually 
about 100 nM or less, more usually about 50 nM or less. However, for various 
applications, one or more drug lead compound(s) having an individual K, for the target 
biological molecule of greater than 500 nM may also find use. 

Another embodiment of the present invention is directed to a method for 
10 inhibiting the interaction between first and second biological molecules which bind to 
each other, wherein the method comprises contacting a system comprising those 
molecules with a binding inhibitory amount of a candidate cross-linked target binding 
fragment or drug lead compound identified by the method described above, wherein 
the drug lead compound or cross-linked target binding fragment binds to the first 
15 biological molecule and inhibits its ability to bind to the second biological molecule. 
For the most part, the first and second biological molecules will be proteins, nucleic 
acids, saccharide complexes, and the like, preferably at least one being a protein, more 
preferably both being proteins. In particularly preferred embodiments, the first or 
second biological molecule may be CD4 or gp!20. In other preferred embodiments, 
20 the first biological molecule may be a protein wherein the second biological molecule 
is a receptor for that protein, a nucleic acid, either DNA or RNA, that binds to that 
protein or a polysaccharide or the first biological molecule is an enzyme wherein the 
second biological molecule is a substrate for that enzyme. 

Systems that comprise both the first and second biological molecules may be 
25 either in vitro or in vivo, wherein the first and second biological molecules are situated 
such that they are capable of binding to one another. For in vivo applications, the lead 
of interest may be administered on its own or in pharmaceutically acceptable media, 
for example normal saline, PBS, etc. The additives may include bactericidal agents, 
stabilizers, buffers, or the like. In order to enhance the half-life of the drug lead 
30 compounds in vivo, the compounds may be encapsulated, introduced into the lumen of 
liposomes, prepared as colloids, or another conventional technique may be employed 
that provides for an extended lifetime thereof. 



- 24 - 



BNSDOCID: <WO 9949314A1 J_> 



WO 99/49314 



PCT/US99/06734 



The drug lead compounds may be administered as a combination therapy with 
other pharmacologically active agents or may physically linked to such agents or other 
carriers. Various methods for administration may be employed. The drug lead 
compounds may be given orally, or may be injected intravascularly, subcutaneously, 
5 peritoneally, etc. A "binding inhibitory amount" of the drug lead compounds will 
vary widely, depending upon the nature of the first and second biological molecules, 
the frequency of administration, the manner of administration, the clearance of the 
compound from the host, and the like. Appropriate binding inhibitory amounts may 
be determined empirically by those skilled in the art in a routine manner. 
1 0 candidate cross-linked target binding fragments 

II. Definitions and preferred embodiments 

In its broadest embodiment, the method of this invention comprises: 
assembling a library of candidate target binding fragments; screening the library of 

15 candidate target binding fragments for those that bind to a target molecule; cross- 
linking target binding fragments to produce a library of candidate cross-linked target 
binding fragments; screening the library of candidate cross-linked target binding 
fragments for those that bind to the target molecule. The product of this method is 
referred to as a lead pharmaceutical or drug candidate. 

2° More specifically (see Figure 9), the method of this invention is used to 

identify lead pharmaceutical drug candidates and optionally involves the following 
simple steps (a-h). 

(a) Assembling a library of candidate target binding fragments each 
fragment having a linkable (or linkage) functional group (LFG) or the blocked 
~ 5 form thereof (BLFG), the blocked form containing linking group (LG); 



30 



CTBF { — LFG 

Assembling a library as used herein means any method of selecting two or more 
molecules to form a library for use in the method of this invention. Preferably the 
library will be large, greater than 50 members, and contain a diverse array of target 
interactive groups capable of forming non-covalent bonds, e.g. hydrogen, Van der 



- 25 - 



BNSDOCID: <WO 99493 14A1 I > 



WO 99/49314 



PCT/US99/06734 



Waals, electrostatic, hydrophobic and the like with the target molecule. Such 
interactive groups may include functional groups found on naturally occurring amino 
acid sidechains, carbohydrates, lipids, nucleic acids and their metabolites and 
derivatives thereof, or groups found on known pharmaceuticals. Optionally, a library 
5 may be customized to contain interactive groups known or suspected to interact with 
binding sites on the biological target molecule or its biological ligand. 

Candidate target binding fragments (CTBF) are small water soluble organic 
molecules having a molecular weight of about 200da (including the LFG) capable of 
forming a non-covalent complex with a target biological molecule (TBM). This 

10 complex may be of low affinity having a Kd as low as about 5mM. The CTBF's are 
commercially available or may be synthesized by known procedures. 

Linkable (or linkage) functional groups (LFG) include any functional groups 
capable of reacting with a chemically cross-reactive functional group (CFG) on a 
cross-linker (XL) thereby forming a stable covalent bond with the cross-linker. This 

15 covalent bond is referred to simply as a linking group (LG). When the linked 
molecule contains more than one linking group an integer following LG is used to 
indicate the number of the LG in the molecule. The LFG includes blocked, protected 
or otherwise transformed groups that may or may not react directly with the CFG on 
the cross-linker. This blocked form of the linkable functional group (BLFG) is often 
20 the preferred form of the CTBF because it may be less likely to form covalent linkages 
with the target biological molecule in the contacting step below. 



In the case where the BLFG is to be used in the contacting step, the de-protected 
or re-transformed form of the CTBF that it is capable or reacting with the CFG on the 
cross-linker is the form used in the cross-linking step below. By way of illustration, a 
CTBF may contain an aldehyde as a LFG or this aldehyde may be protected or 
transformed by reacting it with an O-amino alcohol to form an oxime (BLFG) as 
shown below. 



CTBF —BLFG 



30 




p-R 2 
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In this case the oxime would be considered the BLFG. It will be appreciated that 
more than one reaction or transformation to an LFG may be made. By way of further 
illustration an aldehyde may be reacted with an amine to form a Schiff base, which in 
turn may be reduced to a secondary amine. This may be still further reacted to form an 
5 amide, sulfonamide, urea, carbamate etc. All these transformations of the initial 
aldehyde are also considered BLFG's. 

In the case where the CTBF (including the LFG) is first transformed or 
protected, the initial CTBF is sometimes referred to as a "precursor" while the 
transformed or protected form that is contacted with the TBM is referred to as a 
10 "monomer". Thus in the reaction illustrated above, the aldehyde may be called a 
precursor while the oxime is referred to as a monomer. In this case the oxime covalent 
bond (=N-0-) is referred to as the linking group (LG). It is often preferred that the 
monomer contain the same linking group (LG) as is present in the cross-linked form, 
described below, because some of the binding energy with the target may come from 
15 LG. 

Examples of linkage or linkable functional groups include; aldehyde, ketone, 
primary amine, secondary amine, epoxide, carboxylic acid, sulfonic acid, alcohol 
(hydroxyl), isicyanate, isothiocyanate, halide and sulfonate. These fuctional groups 
may act as precursors of the blocked linkage functional groups. 
10 Examples of CTBF's having blocked linkage or linkable functional groups are 

molecules containing Linkage Groups selected from; oxime, hydrazone, N-acyl 
hydrazone, secondary amine, tertiary amine, acetal, ketal, 1,2 amino alcohols, amide, 
N,N-disubstituted amides, thioamide, ureido, thioureido, carbamate, thiocarbamate, 
thiothiocarbamate, sulfonamide, carbamate, guanidino, amidino, thioester, ester, ether, 
2-hydroxyether, 2-hydroxythioether, thioether, disulfide, alkane (alkylene), alkene 
(alkenylene) and alkyne (alkynylene). Prefered monomers will contain the above 
functional groups as LG's. 

Steps b-d below may be combined into a single screening step that may be referred 
to as a first screening step, a pre-selection step or a pre-screening step. 

0 

(b) Contacting the candidate target binding fragments with a target 
biological molecule (TBM), 
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Contacting of the TBM with one or more members of the library may be 
conducted either individually or multiply. Preferably each candidate target binding 
fragment is contacted individually with the TBM. For example, this may conveniently 
be done in a 96 well format plate so that the formation of a complex with each member 
5 of the library can be conveniently evaluated without requiring any deconvolution step. 
The contacting step is often conducted at relatively high concentrations of the CTBF, 
so that Kd's as low as about 5 mM can be measured (see below). 

The Target Biological Molecule (TBM) may be any biological molecule 
preferably of mammalian and most preferably of human origin. Optionally preferred 
TBM's may be human pathogen proteins such as viral proteins from viruses that infect 
human cells. Preferred TBM's are proteins most preferably secreted proteins. 
Preferred secreted proteins include; enzymes, cytokines, hormones, growth factors and 
their receptors. The TBM's may be isolated from natural sources or made 
recombinately in a host cell. Normally the atoms of the TBM will contain the natural 
abundance isotopes, but in some circumstances may be enriched. When the target is a 
receptor or a cell surface bound molecule, the TBM may conveniently be the 
extracellular domain or a derivative thereof. 

In a preferred embodiment of the method, the Target Biological Molecule is 
not a single biological molecule; rather it is two or more molecules. For example, 
preferred TBM's may be protein-protein, protein-DNA/RNA, protein -substrate pairs. 
By way of illustration, a ligand-receptor pair may be the actual target when an ELISA 
or other biological assay requiring two or more biological molecules is used to 
measure a physical association (see below) such as binding or activity. The binding 
constant, IC50 or other measurement may result from the CTBF binding with either 
the Ligand, receptor or both. Similarly, when selected fragments are cross-linked (see 
below) and re-screened against the ligand-receptor pair, the recombined fragments 
may bind with the ligand, receptor or both. 

(c) Measuring a change in a first physical association (PA-1) of the 
target biological molecule. 
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Measuring a change as used herein means any method capable of quantifying a 
physical association, including binding, of the CTBF with the TBM. 
A physical association (PA-1) of the target molecule includes a biological property 
such as binding with another biological molecule, signal transduction or catalysis of a 
5 reaction. It may also include any measurable physical chemical property such as a 
spectroscopic or magnetic property. Preferably the physical association will be 
suitable for rapid high throughput screening. The most preferred physical association 
measurement will be a biological one such as measuring binding or catalysis. An 
example of binding measurement would be an ELISA assay where protein-protein 
10 antagonism is measured. 

(d) Selecting target binding fragments (TBF) based on (c). 

Selecting target binding fragments (TBF) is based on the physical association 
1 5 measurement step. Selected TBF's will include those that bind relatively weakly with 
the TBM. Thus, for example in a binding assay such as ELISA, fragments that bind 
with no greater than a 5 mM affinity may be selected for. Most commonly the first 
selected CTBF's will bind to the target with an affinity of from 2 mM to 100 uM. 
TBF's or monomers that bind with a higher affinity e.g. Kd < 50 U.M are preferred, 
however such relatively high binding affinities are not necessary for selection for the 
cross-linking step. 



20 



(e) Reacting target binding fragments with a cross-linker, having 
25 chemically compatible cross- reactive group(s) with the LFG, under conditions 
suitable for forming a library of candidate cross-linked target binding fragments 
(CXL-TBF). 

The selected target binding fragments, which individually normally bind to the 
30 target with relatively low affinity, are then cross-linked, normally in all combinations 
and permutations, to produce a library of candidate cross-linked target binding 
fragments (CXL-TBF). By way or illustration, If TBF,,, and TBF are selected from 
the first screen, these molecules are then reacted with a suitable cross-linker such as a 
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bifunctional linker (BFL) to form candidate cross-linked target binding fragments 
(CXL-TBF) for the second screening step according to the general equation: 



10 



15 



20 



25 



.10 



TBF m 


— LFG + 


TBF n 


—LFG + 


BFL 



CXL-TBF 



The BFL above will normally have two cross-reactive groups that are 
compatible with both LFG's, that is they will form stable covalent bond(s) (LG) with 
the LFG's under selected reaction conditions. 

Each TBF may consist of two (or three) parts or moieties when LFG contains 
an atom capable of forming two (or three) bonds other than the linking group (LG) 
bond formed with the cross-linker. For example, when LFG is the carbonyl of a 
ketone, the carbonyl carbon may be bonded to two alkyl, aryl etc parts or moieties 
(part A and part B or part C and part D). This case may be represented generally by 
the diagram below where, for example, two ketones are cross-linked with an 0,0'- 
diamino-alkanediol cross-linker to form a di-oxime: 



TBF m -part A 



TBF n -part C 



TBF m -part B 



LGg — [ XL | LG 4 

J J TBF n -part D 



Here each of the two alkyl, aryl etc parts or moieties from the ketone are bonded to a 
nitrogen atom in the linking group (LG, =N-0-). 

The CXL-TBF's are sometimes referred to as "dimers", however, they are truly 
only dimers when the cross-linking moiety (XL) is only a chemical bond. In the most 
general case these "dimers" will contain a linking moiety, such as an alkane, alkene, 
arylene, alkyl ether and the like bonded through one or more linking groups (LG) to 
the corresponding TMF's. 

Reacting means chemically reacting so that a stable covalent bond or linking 
group LG is formed as the reaction product between the linkable functional group on 
the target binding fragment(s) and a chemically cross-reactive functional group (CFG) 
on the cross-linker. This step may be referred to as a cross-linking step or sometimes a 
"combination" or "recombination" step. 
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15 



20 



25 



30 



Compatible functional group herein means capable of reacting with to form a 
stable LG. 

The cross-linker is a small organic molecule having at least one cross-reactive 
functional group capable of reacting with the linkable functional group of at least one 
of the target binding fragments. Commonly, the cross-linker will have 2-4 such cross- 
reactive functional groups and most commonly 2 such groups. In this case the cross- 
linker is referred to as a bifunctional linker (BFL), either homo-bifunctional or hetero- 
bifunctional depending on whether the compatible functional groups are the same or 
different from one another. 

In the case where the cross-linker has a single cross-reactive functional group, 
the candidate cross-linked target binding fragment, CXL-TBF, is simply the reaction 
product of these two molecules: 

' 1 - LG-XL (or simply CXL-TBF) 



TBF n 



-LFG n + 



-CFG! 



CTBF 



A simple example of such a reaction would be the product of an aldehyde and 
an amine. In this exemplary case Rl is a TBF and R2 Is XL where LFG-1 is an 
aldehyde and CFG-1 is a primary amine: 



R 



R, 



H 



H 



In this case LG is the resulting secondary amine, -(NH)-. 

In the case where the cross-linker is a bifunctional linker, BFL, the linker has 
two cross-reactive linking groups and may be represented generally as: 



CFG, — XL 



—CFG, 



In this case, reacting two target binding fragments from the first screen above, each 
with its own linking functional group with the above BFL may be represented: 



TBF m — LFG, 



TBF r 



LFG, 



CFG, 



XL 



—CFG, 



TBF m LGt 



XL 



LGt 



TBF n 



In the most general case of the above reaction, the two target binding fragments 
will have different linkable functional groups and will be linked by a hetero- 
bifunctional linker (het-BFL). In the most common case, both TBF will have the same 
CFG and will be linked by a homo-bifunctional linker (homo-BFL). In some cases, 
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target binding fragments having different linkable functional groups may be linked 
with a homo-BFL and target binding fragments having the same linkable functional 
groups may be linked with a het-BFL. 

Higher order linking groups such as tri- and tetra-functional linkers may also 



5 be employed in the method of this invention. By way of illustration a Afunctional 
linker such as that shown below may be used: 




In each of the above cases, the product of the cross-linking reaction produces a 
candidate cross-linked target binding fragment(s) (CXL-TBF). These candidate 
molecules, typically with a molecular weight of 400 - 600 daltons, are then screened a 
1 5 second time against the target biological molecule. 

Steps f-h below may be combined into a single screening step that may be 
referred to as a second screening step, a selection step or a final screening step. 

(f) Contacting the candidate cross-linked target binding fragments 
20 with the target biological molecule. 

Contacting of the TBM and one or more members of the library of candidate 
cross-linked target binding fragments (CXL-TBF) may be conducted either 
individually or multiply as before. Preferably each candidate cross-linked target 

25 binding fragment is contacted individually with the TBM. The contacting step is 
usually conducted at lower concentrations of the CXL-TBF compared to CTBF's in 
the first screening step, so that Kd's in the micromolar or nanomolar range can be 
measured. The contacting format may be the same or different from that in the first or 
pre-screening step. Thus for example the two contacting steps may both be part of a 

30 binding assay (e.g. ELISA) or, for example, the second contacting step may be a 
functional aclivity assay or cell based binding assay. 

(g) measuring a change in a second physical association (PA-2) of the 
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target biological molecule. 

The second physical association (PA-2) measurement for the CXL-TBF's may be 
the same as that used for the CTBF's or may be a different physical measurement. 
5 Preferably the second physical association measurement will be a biological 
measurement rather than a physical chemical measurement such as spectroscopic or 
the like. 

(h) selecting cross-linked target binding fragments (XL-TBF) based of 

(8). 

10 

The cross-linked target binding fragments (XL-TBF) selected will typically 
have a Kd, IC-50 or the equivalent of 500 nM or better. These XL-TBF's are useful as 
drug lead pharmaceutical candidate molecules. 

Ill Specific Chemistry 

Many chemistries may be employed to produce linkable functional groups or to 
block or derivitize them. Similarly a wide array or linking chemistries are possible. 
Described below are a number of chemistries suitable for rapid high throughput 
screening. This description is meant to be illustrative and not limiting. In the 
description below the term "CTBF" is equivalent to "CTBF-part A" plus " CTBF-part 
B" defined above. Similarly, Linkable functional group is used interchangeably with 
linkage functional group. 

The term "alky!" means a cyclic, branched or unbranched saturated or 
unsaturated hydrocarbon radical having the number of carbons specified, or if no 
number is specified, up to 12 carbon atoms. 

The term "aryl" means a homocyclic aromatic hydrocarbon radical having 
from 6-14 carbon atoms. Examples include phenyl, napthyl, biphenyl, phenanthrenyl, 
napthacenyl and the like. 

The term "heteroaryl" means a heterocyclic aromatic radical having from 4-13 
carbon atoms and from 1-6 heteroatoms selected from N, O, S and P. 

The term "heterocycle" means a saturated or partially unsaturated cyclic radical 
having from 3-13 carbon atoms and from 1-6 heteroatoms selected from O, S, N and P. 
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The term "alkoxy" means an alkyl radical, as defined above, substituted with 
an oxo radicai^O-)- 

The term "acyl" means alkanoyl or alkylcarbonyl having from 1-12 carbon 

atoms. 

5 The term "carboxy ester" means acyloxy or alkanoyloxy having from 1-12 

carbon atoms. 

The term "carboxamide" means alkylcarbonylamino having from 1-12 carbon 

atoms. 

GENERAL NOTE: When CTBFs are employed with BLFGs, they may be 
10 prepared as described belo^v from CTBFs containing the corresponding LFGs. 
Alternatively, it may be practical to purchase, otherwise acquire, or prepare by known 
methods the CTBFs with BLFGs using alternative chemistry. 

Candidate Target Binding Fragments or Molecules (CTBFs) 

15 

L Aldehyde and ketone as the linkage functional groups and the corresponding 
blocked linkage functional groups 

(a). The linkage functional group is the carbonyl group present in an aldehyde 
or a ketone. These CTBFs may be available commercially, or may be prepared by a 
20 variety of known methods to those practiced in the art. 
Aldehyde CTBFs are represented as follows. 



CTBF — CHO 



Ketone CTBFs are represented, with two different parts (A and B) of the CTBF 
25 attached to ketone carbonyl as shown below. 



30 



CTBF-part A 



CTBF-part B 



(b). Alternatively, the library is assembled with the oxime group as the 
blocked linkage functional group (BLFG). These CTBF's are prepared by 
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condensation of aldehydes or ketones LFG's with an O-substituted hydroxylamine. 
R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, heterocycle, carboxy 
5 ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, W-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 

/OR 8 

. . . N 

CTBF — CHO + H 2 N-OR 8 ^ CTBF — CH 



15 



or 



CTBF-part A 



4? 



P 



+ HjN-OR 8 



CTBF-part A 



N-OR 8 



CTBF-part B 



CTBF-part B 



20 



(c). Alternatively, the library is assembled with the hydrazone group as the 
BLFG. These CTBFs are prepared by condensation of aldehydes or ketones LFGs 
with an //-substituted hydrazine(March, Advanced Organic Chemistry, John Wiley & 
Sons, New York, 4 th edition, 1992, pp. 904-906). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
25 or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, //-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

,NHR 8 



CTBF 



.10 



CTBF-part A 



CHO 



H 2 N— NHR 8 



or 



+ H 2 N— NHR 8 



CTBF — CH 



CTBF-part A 



N— NHR 8 



CTBF-part B 



CTBF-part B 
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(d). Alternatively, the library is assembled with the N-acylhydrazone group as 
the BLFG. These CTBF's are prepared by condensation of aldehydes or ketone LFG's 
and an N-acyl hydrazine. Many reaction conditions are known to those practiced in 
5 the art (e.g., [March et al. supra, pp. 905-906] and [Li et al. Chem. Biol, 1: 37 
(1994)]). X may be nothing, O, S, NH or NR 9 . R« or R9 may be H, or a straight chain 
or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R8 or R* may also be appended with up to 
five R 10 groups (R™ is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
10 acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R« or R? may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



15 



20 



CTBF 



}— CHO + ^n^-XR 1 



CTBF 



VSr 1 "' 

I— CH O 



or 



CTBF-part A 



k N^XR 8 * 



CTBF-part B 



CTBF-part A 



H 

o 



XR 8 



| CTBF-part B 



(e). Alternatively, the library is assembled with an amine group as the BLFG. 
These CTBF's are prepared by reductive amination of aldehydes or ketone LFGs. A 
large number of reducing agents could be employed that are known to those practiced 

25 in the art (March, et al. supra, pp. 898-900). R» or R* may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R« or R* may also be appended with up to 
five Rio groups (R io is alkyl) ary , heteroaryl, carboxy ester, carboxamide, amino, N- 

^ acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R» or R? may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy. hydroxy, mercapto, 
phosphono). 
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CTBF 



-CHO + 



HNR 8 R 9 



reducing agent ^ 



CTBF 



fsIR 8 R 9 
-CH 2 



CTBF-part A 



HNR**R 



8 D 9 



or 



reducing agent 



CTBF-part A 



CTBF-part B 



NR 8 R 9 



CTBF-part B 



(0- Alternatively, the library is assembled with an acetal or ketal group as the 
BLFG. These CTBF's are prepared by condensing the aldehyde or ketone LFG's with 
a diol. Conditions for the preparation of acetals or ketals are known to those practiced 
in the art (March, et al. supra, pp. 889-891). R 8 may be a straight chain or branched 
alkyl group of length 2 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). R 8 may be a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —CHO + 



HO-R^OH 



CTBF h-C-O 
H 



<j>-R 



or 



CTBF-part A 



HO-R*-OH 



CTBF-part A 



O-R 8 

-U> 



| CTBF-part B 



CTBF-part B 



(g). Alternatively, the library BLFG's may be prepared by condensing the 
aldehyde or ketone LFGS with an amino alcohol or an amino thiol. Methods to prepare 
the product oxazolidines and thiazolidines are known to those practiced in the art (e.g., 
oxazolidines: Ede, et al. Tetrahedron Letters, 38: 71 19-7122 (1997), and thiazolidines: 
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Patek et al. Tetrahedron Letters, 36:2227-2230 (1995)). R« or may be a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
5 acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). R 9 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 9 may also be 
appended with up to five R 1 * groups (R 11 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



20 



25 



30 



CTBF — CHO + 



HO— R 8 — NHR 9 



o-? 1 



CTBF — C-NR 9 
H 



or 



CTBF-part A 



HO— R 4 — NHR 9 



CTBF-part A 



O-R 8 
•NR 9 



CTBF-part B 



CTBF-part B 



(h). Alternatively, the library is assembled with the alkene group as the 
blocked linkage functional group (BLFG). These CTBF's are prepared by 
condensation of aldehydes or ketone LFG's with phosphorous ylides (Maryanoff et al. 
Chemical Reviews, 89, 863-927 (1989). R 8 and R' may be H or a straight chain or 
branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R' may also be appended with up to 
five R 10 groups (R'° is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R* and R* may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 and R 9 may also be halogens and heteroatoms. 
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j CTBF-part A 



CTBF-part A 



CTBF-part A 
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(i). Alternatively, carbanions, usually stabilized carbanions, may also be added 
into the carbonyl. Either the alcohol product A is obtained, or the hydroxyl group is 
eliminated to provide an alkene B. Numerous methods are available for performing 
this transformation (March et al., supra, 937-950). ). R 8 and R* may be H or a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R* may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 and R 9 may also be halogens and heteroatoms. 



25 h .R 8 

CTBF— CHO + C— — R 



base 



T 



i a ^ 

CTBF] — CH 



A R 9 



30 



CTBF-part a] 



H \ C /^ R 9 _^ JCTBF-part A I C 



\ ^CH— R 9 



CTBF-part A 
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2. Primary and secondary amines as the linkage functional groups and the 
correspondingHldcked linkage functional groups . 

(a). The linkage functional group is the basic nitrogen of primary or secondary 
amines. These CTBF's may be available commercially, or may be prepared by a 
variety of known methods to those practiced in the art. CTBF's that have a primary 
amine LFG's are represented as follows. 



CTBF — NH 2 



CTBF's that have a secondary amine LFG's are represented with two different parts 
j 0 (A and B) of the CTBF attached to the amine group as shown below. 



CTBF-part A 



-fjJH 



CTBF-part B 



(b). Alternatively, the library may be assembled as secondary or tertiary amine 
15 BLFGs. 

(i). CTBF's with secondary or tertiary amine B LFG's may be prepared 
by reductive amination of primary amine or secondary amine LFG's, respectively, 
with aldehydes and ketones. A large number of reducing agents could be employed 
that are known to those practiced in the art (March, et al. supra, pp. 898-900). R 8 or 
20 R 9 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also 
be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 or R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). Numerous reductive amination methods may be used that are 
known to those practiced in the art. 



Reducing Agent r- 1 9 



CTBFj-NH 2 + RW "eoucngagen^ | Pharmacophore | — NH 



HR 8 R 



8 D 9 



or 
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Reducing Agent 



CTBF-part A — N 



pHR 8 R 9 



CTBF-part B 



CTBF-part B 



(ii). Alternatively, CTBF's with secondary or tertiary amine BLFG's 
5 may be prepared by reaction of primary or secondary amine LFG's, respectively, with 
an aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be 
a halide or a sulfonate group (OSO2R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH3, CF3, phenyl-CH3 and phenyl-NC>2). The halide could be attached to 
aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 

^ group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an SwAr reaction or a palladium-mediated (or related transtion metal mediated) amine 
coupling reaction would be performed [e.g, March, et al. supra, pp. 656-657; Wagaw 
et al., 7. Am. Chem. Soc, 1 19: 8451-8458 (1997) and references therein; and Ahman et 
al. Tetrahedron Letters, 38: 6363-6366 (1997)]. Where X is substituted upon alkyl 

5 functionality, an S^2 or Sjsjl reaction would be performed (March, et al., supra, pp. 
411-413). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



»8 



CTBF — NH 2 



X-R 8 



CTBF — NH 



or 



CTBF-part A 



-NH + X-R 8 



CTBF-part B 



CTBF-part A 



A_ 



CTBF-part B 



(iii). Alternatively, CTBF's with secondary or tertiary amine BLFG's 
may be prepared by reaction of primary or secondary amine LFG's, respectively, with 
a substituted epoxide (March, et al., supra, p. 416). Many epoxides are available 
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commercially. Alternatively they can be prepared by a number of known methods to 
those practicedin the art, most preferably by epoxidation of an alkene. R 8 to R 11 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 to R 11 may 
also be appended with up to five R 12 groups (R 12 is alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 to R 11 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



R 



CTBF 



-NH 2 



R 9 R 11 



CTBF 




-NH 



H 

R 10 

R 11 



or 



CTBF-part A 



— NH 



+ R 9 R" 



CTBF-part A 



CTBF-part B 



CTBF-part B 



(c). Alternatively, the library may be assembled as amide or thioamide 
BLFG's. These CTBF's may be prepared by coupling primary and secondary amine 
LFG's with carboxylic acids (X = H), carboxylic acid derivatives (X = OR, SR, 
halide), or the corresponding thiocarboxylic acid derivatives (X = OR, SR, halide). 
Numerous methods are also available for coupling carboxylic acids and carboxylic 
acid derivatives with amines and are known to those practiced in the art [e.g., March, 
et a!., supra, pp. 417-425; and Fields et al. Int. J. Peptide Protein Res. 35:181-187 
(1990)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



- 42 - 



9949314A1 I > 



WO 99/49314 



PCT/US99/06734 



CTBF 



— NHo + 



CTBF-part A 



— NH 
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CTBF 



-NH 
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or 



CTBF-part A 



CTBF-part S 



CTBF-part B 



(d). Alternatively, the library may be assembled as urea or thiourea BLFG's. 

(i). These CTBF's may be prepared by condensation of primary or 
1 0 secondary amine LFGs and isocyanates or isothiocyanates. The direct coupling of 
isocyanates and isothiocyanates with amines is known to those practiced in the art 
(March, et al., supra, p. 903). R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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CTBF — NH 2 + 



2=C=N— R fi 
Z =0, S 



CTBF 



or 



CTBF-part A 



-NH 



CTBF-part B 



Z = O, S 



CTBF-part A 
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H H 

z = o,s 



CTBF-part B 



(ii). Alternatively, these CTBF's may be prepared by a two step 
process. In the first step, the primary or secondary amine LFG's is converted to 
30 carbamate (thiocarbamates) or related derivatives where X and Y are alkoxy groups, 
mercaptyl groups, halides, or other suitable leaving groups. In the second step a 
primary or secondary amine is added (e.g., Hutchins, Tetrahedron Letters, 35: 4055- 
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4058, (1994) and references therein). R 8 and R 9 may be H, or a straight chain or 
branched alkyl "groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up 
to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
S /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also 
be aryl or heteroaryl groups that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino Af-acylamino, alkoxy, hydroxy, mercapto, 
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phosphono). 
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(2) 



CTBF-part A 



CTBF-part B 



R 9 



CTBF-part A 



CTBF-part B 



(iii). These CTBF's may also be prepared by condensation of primary 
or secondary amine LFG's and carbamates, thiocarbamates, or related derivatives 
where X is an alkoxy group, a mercaptyl group, or a halide (e.g., Hutchins, 
Tetrahedron Letters, 35: 4055-4058, (1994) and references therein). R 8 and R 9 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
10 ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be aryl or heteroaryl groups that are optionally 
substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino W-acylamino, 
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alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



-NH, 



z = o,s 



CTBF 



z = o,s 



or 



CTBF-part A 



-NH 



CTBF-part B 



CTBF-part A — N I 



z = o,s 



CTBF-part B 



(d). Alternatively, the library may be assembled as sulfonamide BLFG's. 
These CTBF's may be prepared by coupling primary and secondary amine LFG's with 
sulfonic acids (X = H) or sulfonic acid derivatives (X = OR, SR, halide). Numerous 
methods are available for coupling sulfonic acids and sulfonic acid derivatives with 
amines and are known to those practiced in the art (e.g., March, et al., supra, p. 499, 
Greene, et al. Protective Groups in Organic Synthesis, John Wiley & Sons, New York, 
2 nd edition, 1991, pp. 379-385). R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is halide, 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, 
carboxamide, amino, //-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — NH;> + 



CTBF-part A 



NH 



' Sx R 8 



or 



1 V 

CTBF K n" s -r° 

H 



CTBF-part A 



CTBF-part B 



1 



CTBF-part B 



(e). Alternatively, the library may be assembled as carbamate, thiocarbamate 
or related BLFG's. These CTBF's may be prepared by condensation of primary and 
secondary amine pharmacophores with carbonyl derivatives (X = halide, OR, SR; Y = 
S,Q; Z = S,Q). Numerous methods are known to those practiced in the art (e.g., March, 
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et al., supra, p. 418; and Greene, et al., supra, pp. 315-348). R» ma y be H, or a 
straight chain or-branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to 
five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
5 acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino //-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 



10 



15 



JL 



CTBF t ~ NH 2 + X^YR« 

Z = O.S; Y = S, O 



CTBF 



Z 
11 

-N^YR 8 
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or 



CTBF^Part A 



}-NH 
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CTBF— Part B 



Z = O t S; Y = S t O 



CTBF^-Part B 



(0- Alternatively, the library may be assembled as guanidine BLFGs. These 
CTBFs may be prepared by condensation of primary and secondary amine 
pharmacophores with carbonyl derivatives (X = halide, OS02R, OR, SR) [Roskamp, 
20 et al. Tetrahedron, 53:6697-6705 (1997)]. R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 
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30 



NR 10 



CTBM — NH 2 + 
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NR 8 R 9 
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CTBM-Fragment A I — NH + X^NR 8 R 9 (CTBM — Fragment A [— N^NR 8 R 9 



CTBM— Fragment B CTBfot— Fragment B 



5 J. Epoxides as the linkage functional groups and the corresponding blocked linkage 
functional groups. 

(a). The linkage functional group is the epoxide group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

1 0 Epoxide CTBFs are represented with four parts (A through D) of the CTBF attached to 
the epoxide group as shown below. Each fragment may be H, or functionality 
whereby a carbon atom is directly attached to the epoxide functionality. 



CTBF— Part A 



CTBF— Part B 




CTBF — Part C 



CTBF — Part D 



15 

(b). Alternatively, the library may be assembled as 1,2-amino alcohol BLFGs. 
These CTBFs may be prepared by coupling epoxide LFGs with primary or secondary 
amines employing known methods to those practiced in the art (March, et al., supra, p. 
416).. R 8 or R 9 may be H, or a straight chain or branched alkyl group of length 1 to 

2 0 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
or R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 

25 acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — Part A 




CTBF — Part C | | CTBF— Part A f y Y A CTBF— Part C | 



CTBF-Part B K X ] ^ TPC _ D ^ „ 1 + NHR R " I CTBF-Part B 



CTBF— Part D I ^' Dl p r R 8 R 9 N ^ CTBF— Part D 




(c). Alternatively, the library may be assembled as, 2-hydroxy thioether 
30 BLFGs. These CTBFs may be prepared by coupling epoxide LFGs with thiols 
employing known methods to those practiced in the art (Wardell, in Patai The 
Chemistry of the Thiol Group, pt. 1; Wiley, New York, 1974, pp. 246-251). R 8 may 
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be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from I to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). The 2-hydroxy thioether BLFGs may also be readily oxidized 
to more water soluble 2-hydroxy sulfoxide or sulfone BLFGs (March, et al., supra, pp. 
1201-1203). 

I CTBF — Part A k /\ J CTBF— Part C I I CTBF— Part A k ? H J CTBF— Part C 
1 . hsr8 '^J ^\ 




CTBF-Part B K Nj » 1 CTBF — Part B 



CTBF-Part D I ^'"^ ™" ° V R»s ^ CTBF-Part D 



(d). Alternatively, the library may be assembled as,2-hydroxy ether BLFGs. 
These CTBFs may be prepared by coupling epoxide LFGs with alcohols employing 
known methods to those practiced in the art(March, et al., supra, p. 391). R 8 may be 
H, or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended 
with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also 
be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



| CTBF— Part A K y ^/y^H CTBF — Part C | ^ | CTBF-Part A kj" ^\ CTBF — Part C | 



CTBF Part B K C^F-Pa T^H ™ I CTBF-Part S CTBF — Part D | 



4. Carboxylic acids as the linkage functional groups and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the carboxylic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Carboxylic acid CTBFs are represented as follows. 



- 48 - 



BNSDOCID: <WO 9949314A1 I > 



WO 99/49314 



PCT/US99/06734 



CTBF — COOH 



10 



15 



(b). Alternatively, the library may be assembled as amide BLFGs. These 
CTBFs may be prepared by coupling carboxylic acid LFGs (X = OH), or derivatives 
of carboxylic acid LFGs (X = OR, SR, halide) with primary or secondary amines. 
5 Numerous known methods are available for coupling carboxylic acids and carboxylic 
acid derivatives with amines to those practiced in the art [e.g., March, et al., supra, pp. 
417-425; and Fields et al. Int. J. Peptide Protein Res. 35:181-187 (1990)]. R 8 and R 9 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, TV-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 

^ + HN — R 8 >- I CTBF I — ^ 



25 



M) 



CTBF v . ... ( 



R 9 



NR 8 R 



(8 D 9 



(c). Alternatively, amine BLFGs could be prepared by reduction of amide 
BLFGs prepared as described above in step 4b from the corresponding carboxylic acid 
20 LFGs. A number of reducing agents could be employed that are known to those 
practiced in the art (e.g., March, et al., supra, pp. 1212-1213). R 8 or R 9 may be H, or 
a straight chain or branched alkyl group of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also be appended 
with up to five R 10 groups (R*° is alkyl, ary], heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



K° Reducing Agent , * ^R 8 ** 
CTBF — CH2 



CTBF x - 
NF^R 



(d). Alternatively, the library may be assembled as ester BLFGs. 

(i). These CTBFs may be prepared by coupling carboxylic acid LFGs 
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(X = OH), or derivatives of carboxylic acid LFGs (X = OR, SR, halide) with alcohols. 
Carboxylic acids ^hd carboxylic acid derivatives may be coupled with alcohols 
employing numerous known methods to those practiced in the art [e.g., (March, et al., 
supra, pp. 392-398) and (e.g., Greene, et al., supra, pp. 227-228), and (Hughes et 
5 ah(Paquette, Series Editor in Chief), Organic Reactions ; John Wiley & Sons, New 
York, 1992, vol. 42, pp. 343-347)]. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono).* R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, //-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



15 



20 



30 



CTBF 



O O 

—\ + HO-R 8 I CTBF \—\ 



OFT 



(ii). These CTBFs may be prepared by reacting carboxylic acid LFGs 
with an aryl, heteroaryl or alkyl group substituted with a leaving group X, where X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 5 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality on R 8 , or it could be attached to 
an aliphatic group on R 8 . When X is substituted upon aromatic and heteroaromatic 
functionality, an S N Ar reaction or a palladium-mediated, copper-mediated or related 
transtion metal mediated coupling reaction would be performed. Where X is 
substituted upon alkyl functionality, an S N 2 or SnI reaction would be performed. 
25 Methods for these transformations are known to those practiced in the art [e.g., 
(March, et al., supra y pp. 398-399) and (e.g., Greene, et al., supra, pp. 228-229)]. R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, jV-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
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mercapto, phosphono). 



10 



15 



20 



10 



CTBF 



O P 

— // + x-R 8 I CTBF I — \ 

OH 1 1 



OR* 



(e). Alternatively the library may be assembled as thioester BLFGs. 

(i). These CTBFs may be prepared by condensation of carboxylic acid 
LFGs (X = OH), or carboxylic acid derivatives (X = OR, SR, halide) and thiols. 
Carboxylic acids and carboxylic acid derivatives may be coupled with thiols 
employing known methods to those practiced in the art (e.g., March, et al., supra, p. 
409). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, TV-acylamino, 
alkoxy, hydroxy, mercapto, phosphono). 

" — ^ + HS-R 8 >- | CTBF | — C 



CTBF 



SR 



5. Sulfonic acids as the linkage functional groups and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the sulfonic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
2$ practiced in the art. 

Sulfonic acid CTBFs are represented as follows. 



CTBF f— SQ 3 H 



(b). Alternatively, the library may be assembled as sulfonamide BLFGs. 
These CTBFs may be prepared by reacting sulfonic acid LFGs (X = OH) or 
derivatives of sulfonic acids (X = halide, alkoxyl, mercaptyl) with amines (e.g., March, 
et al., supra, p. 499, Greene, et al. supra, pp. 379-385). R 8 and R 9 may be H, or a 
straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
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to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended 
with up to five tT 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
5 heteroaryl, carboxy ester, carboxamide, amino Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



10 



CTBF — + HN— R 8 



X ' 9 



CTBF 



^N-R 8 



R 9 i 9 



R a 

6. The hydroxyl group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the hydroxyl group. 

These CTBFs may be available commercially, or may be prepared by a variety 
of known methods to those practiced in the art. 
j 5 Alcohol CTBFs are represented as follows. 

~~ 1 OH 



CTBF 



(b). Alternatively the library may be assembled as ether BLFGs. 

(i). The ether may be prepared by reaction of an alcohol LFG with an 
aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an SfsjAr reaction or a palladium-mediated, copper-mediated or related transtion metal 

25 

mediated coupling reaction would be performed [(e.g., March, et al., supra, pp. 654- 
655) and Hartwig et al., supra y pp. 8005-8008). Where X is substituted upon alkyl 
functionality, an Sjsj2 or SnJ reaction would be performed. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
30 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 7V- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
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or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

-OH + X— R 8 



CTBF 



CTBF — O-R 8 



10 



15 



(ii). The ether may be prepared by the Mitsunobu reaction between 
5 alcohol LFGs and another alcohol where the second alcohol is acidic (pKa < 12), for 
example, phenols and oximes (Hughes et aI.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636). R 8 may be H, 
or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
The key requirement on R 8 is that the alcohol has a pKa < 12. 

s 



25 



30 



CTBF —OH + HO— 



CTBF — O-R 



(iii). The ether may also be prepared by the Mitsunobu reaction 
between alcohol LFGs and another alcohol, where the alcohol LFGs are acidic (pK a < 
12), for example, phenols or oximes (Hughes et aI.(Paquette, Series Editor in Chief), 
20 Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636). R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
The key requirement on the pharmacophore substituted alcohol is that it has a pK a < 
12. 

' ' 8 



CTBF 



-OH + HO — R 



CTBF — O-R 



(c). Alternatively the library may be assembled as ester (Z = O) or thioester (Z 
= S) BLFGs. Alcohol LFGs may be coupled with carboxylic acids (Z = O, X = OH), 
carboxylic acid derivatives (Z = O, X = OR, SR, halide), or the corresponding thio- 
substituled derivatives (Z = S, X = OH, OR, SR halide). Carboxylic acids and 
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carboxylic acid derivatives may be coupled with alcohols employing a variety of 
known methods to those practiced in the art [e.g., (March, et al., supra, pp. 392-398) 
and (e.g., Greene, et al., supra, pp. 227-228), and (Hughes et al.(Paquette, Series 
Editor in Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, 
pp. 343-347)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
7V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



A. r u^~ R 

+ * R w CTBF — O 



CTBF —OH 

z = o,s 

(d). Alternatively the library may be assembled from thioether BLFGs. The 
thioethers may be prepared by the Mitsunobu reaction between alcohol LFGs and a 
thiol (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley 
& Sons, New York, 1992, vol. 42, pp. 365-366, 381-382). R 8 may be H, or a straight 
2o chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
25 carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). The 
thioether BLFGs may also be readily oxidized to more water soluble sulfoxide or 
sulfone BLFGs. 



CTBF —OH + HS — R 8 



CTBF — S— R 1 



(e). Alternatively the library may be assembled as carbamate (Z = O) or 
thiocarbamate (Z = S) BLFGs. 

(i). These CTBFs may be prepared by condensation of alcohol LFGs 
and isocyanates or isothiocyanates. The direct coupling of isocyanales and 
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isothiocyanates with alcohols is straightforward and obvious to those practiced in the 
art (March, et al., supra, pp. 891-892). R 8 may be H, or a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, TV-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino Af-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

Z 



CTBF —OH + £=C=N— R 8 ^ CTBF | ^ c ^.^ r8 



X> 1ST 
H 

Z = O, S Z = O f S 

(ii). These CTBFs may be prepared by condensation of alcohol LFGs 
and carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



d9 I 



-OH 

" R* 



^R 



Z = O.S 2 = O.S 

25 (iii). These CTBFs may be prepared in a two step process (e.g., March, 

et al., supra, p. 418; and Greene, et al., supra, pp. 315-348). In the first step, the 
alcohol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or related 
derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or other 
suitable leaving groups. In the second step an amine is added to displace the leaving 
30 group Y. R 8 and R 9 may be H, or a straight chain or branched alkyl groups of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, 
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heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R» and R 9 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino iV-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

I. 



0) 



CTBF 



-OH 



+ A 



(2) 



CTBF 



z = o,s 



Y 

z = o, S 



R<k 
+ fjJH 



CTBF 



CTBF 



z 

A 



R 9 



" Z = O.S 

(f). Alternatively the library may be assembled with thioester BLFGs. The 
thioesters may be prepared by the Mitsunobu reaction between alcohol LFGs and a 
thiol acid (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John 
Wiley & Sons, New York, 1992, vol. 42, pp. 343-347). R« may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R« may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —OH + HS 



CTBF 



(g). Alternatively the library may be assembled with amide, thioamide, urea, 
thiourea, sulfonamide, carbamate, thiocarbamate (Z = SO2, CO, CS, CO2, COS, CSO, 
CONR", CSNRH.) BLFGs. Substitution of the hydroxyl LFGs are accomplished 
using the Mitsunobu reaction [ e.g., (Hughes et al.(Paquette, Series Editor in Chief), 
Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636) and 
(Fukuyama, et al., Tetrahedron Letters, 38: 5831-5834 (1997) and references therein)]. 
An aliphatic hydroxyl group is prefered. R» and R 9 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R« and R 9 may also be appended with up 
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to five R 10 groups (R™ is alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). . R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
halide, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

2^R 8 



CTBF }-OH -f^N^R 8 



CTBF — N 



\ 



Z = S0 2 , CO, CS, C0 2 , COS, CSO, CONR 10 CSNR 10 
(h). Alternatively, the library man be assembled with amine BLFGs. The 
amine BLFGs will be prepared from the BLFGs described in 6g. The acyl or sulfonyl 
functionality may be removed by methods known to those practiced in the art (such as, 
acidic or basic hydrolysis, or dissolving metal reactions for sulfonamides Greene et al., 
supra, 349-357 and 379-385). Milder conditions may be applied to more specialized 
groups, e.g., trifluoroacetamides may be cleaved by mild basic hydrolysis (Greene et 
al., supra, 353-354), and nitrosubstituted benzenesulfonamides may be cleaved by 
thiolate addition) to provide secondary amines (Fukuyama, et ah, Tetrahedron Letters, 
38: 5831-5834 (1997) and references therein). Alternatively for some derivatives, 
reduction will provide tertiary amines (e.g., March, et ah, supra, pp. 1212-1213). 



CTBF — N 



Z-R J 



CTBF — N 



\ 



CTBF — N 



,Z-R B 



or 



reduction 



\ 



CTBF 



-N 



25 z = S0 2 , CO, CS, C0 2 , COS, CSO, CONR 10 , CSNR 10 

7. The thiol group as .the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the thiol group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
10 practiced in the art. 

Thiol CTBFs are represented as follows. 



CTBM — SH 
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(b). Alternatively the library may be assembled as thioether BLFGs. 

(fT The thioether may be prepared by reaction of a thiol LFG with an 
aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
5 aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
aromatic or hetero aromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, 
an SNAr reaction or a palladium-mediated, copper-mediated or related transtion metal 
mediated coupling reaction would be performed. Where X is substituted upon alkyl 
10 functionality, an Sn2 or SnI reaction would be performed. These methods are known 
to those practiced in the art [(March et al., supra, pp. 407-409) and (Peach in Patai The 
Chemistry of the Thiol Group, pt. 1, John Wiley & Sons, New York, 1974, pp. 721- 
735)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, TV-acyl amino, alkoxy, 
20 hydroxy, mercapto, phosphono). The thioether BLFGs may also b oxidized to more 
water soluble sulfoxide and sulfone BLFGs (March, et al., supra, pp. 1201-1203). 



CTBF — SH + X — R 8 CTBF — S-R 8 



25 



(ii). The thioether may also be prepared by the Mitsunobu reaction 
between alcohol LFGs and a thiol (e.g., (Hughes et al. (Paquette, Series Editor in 
Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335- 
636).. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
3() ester, carboxamide, amino, W-acylamino, alkoxy, mercapto, phosphono, sulphono). 
The thioether BLFGs may also b oxidized to more water soluble sulfoxide and 
sulfone BLFGs (March, et al., supra, pp. 1201-1203). 
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-SH + HO— R 



8 



CTBF — S— R fi 



(c). Alternatively the library may be assembled with BLFGs whereby the thiol 
LFGs are acylated (March et al. f supra, p. 409). Thiol LFGs may be coupled with 
carboxylic acids (Z = O, X = H), carboxylic acid derivatives (Z = O, X = OR, SR, 
5 halide), or the corresponding thio-substituted derivatives (Z = S, X = H, OR, SR 
halide). Carboxylic acids and carboxylic acid derivatives may be coupled with thiols 
employing known methods to those practiced in the art. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R» may also be appended with up to five 
R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R8 may also be an aryl 
or heteroaryl group that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, 
j 5 phosphono). 



+ X^R 8 >- CTBF \—S 



CTBF \ — SH 

Z = O.S 

(d). Alternatively the library may be assembled as thiocarbamate (Z = O) or 

20 thiocarbamate (Z = S) BLFGs. 

(i). These CTBFs may be prepared by condensation of thiol LFGs and 
isocyanates or isothiocyanates. The direct coupling of isocyanates and isothiocyanates 
with thiols is straigh forward and obvious to those practiced in the art (Greene et al. 
supra, p. 301). R» may be H, or a straight chain or branched alkyl group of length 1 to 

25 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R*> groups (R* is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, W-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino W-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 
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CTBF 



8 



CTBF 



H 



Z = O, S 2 = O t S 

(ii). These CTBFs may be prepared by condensation of thiol LFGs and 
carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



~ SH + X^^ R8 | CTBF k JL^ R 



CTBF 

R R 9 

z = o,s z = o,s 

(iii). These CTBFs may be prepared in a two step process. In the first 
step, the thiol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or 
related derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or 
other suitable leaving groups (Greene et al. supra, pp. 299-301). In the second step an 
amine is added to displace the leaving group Y. R 8 and R 9 may be H, or a straight 
chain or branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up 
to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
TV-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also 
be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino Af-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 
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(1) 



CTBF 



-SH 



z = o. s 



CTBF 



X 



(2) 



CTBF 



z = o,s 



y + R ^rsiH 



CTBF 



2 = O.S 

(e). Alternatively the library may be assembled as disulfide BLFGs. The 
disulfides may be prepared by reacting the thiol LFGs with thiols (X = H) or activated 
thiols (X = mercaptyl, halide, sulfonyl) employing known methods to those practiced 
in the art (e.g., Greene et al., supra, pp. 302-303). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide. amino. V- 
acylarnino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R s may also be an aryl 
or heteroaryl group that is optionally substituted (halide, alkyl, aryl. halide, heteroar>.. 
carboxy ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapie. 
phosphono). 

' CTBF I— S — S-R 8 



CTBF — SH + XS-R B 



8. The isocyanate or isothiocyanate group as the linkage functional group ana ii:<r 
corresponding blocked linkage functional groups 

(a). The linkage functional group is the isocyanate (Z = O) or isothi<vyina;e 
(Z - S) group. These CTBFs may be available commercially, or may be prepared by a 
variety of known methods to those practiced in the art. 

Isocyanate (Z = O) or isothiocyanate (Z .= S) CTBFs are represented as follow 5. 

-N=C=Z 



CTBF 



z = o, s 

(b). Alternatively the library may be assembled with urea or thiourea BLFGs 
by reaction of the isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with amines 
(March et al., supra, p. 903). The direct coupling of isocyanates with amines :s 
straighforward and known to those practiced in the art. R 8 and R 9 may be H. or ; 
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straight chain or branched alkyl groups of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended 
with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

r ^M--rl 



CTBF 



-N=C=2 + HN-R 8 I CTBF - N 

R 



R 9 H ^9 



z = o, S „ 

(c). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with thiols (Green et al., supra, p. 
301). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino 7V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 

X R* 

H 



CTBF — N=C=Z + HS-R 8 



CTBF 



z = o, S 

(d). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with alcohols (March et ah, supra, 
pp. 891-892). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, 7V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino A'-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 
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CTBF — N=C=Z -f HO— R CTBF — N 

H ~ 

z = o, S 

9. TVie halide or related sulfonate group as the linkage functional group and the 
5 corresponding blocked linkage functional groups 

(a). The linkage functional group is the halide leaving group or a similarly 
reactive sulfonate (X= OS0 2 R where R is substituted or unsubstituted alkyl or aryl, 
e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ) leaving group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
I o practiced in the art. 

The halide and sulfonate (X= OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ) CTBFs may be represented 
as follows. 



20 



25 



CTBF — X 



(b). Alternatively the library may be assembled with amine BLFGs. 

(i). The amine BLFGs may be prepared by substitution of the leaving 
group (X = halide or OS0 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. 
CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ) with amines The leaving group may be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could 
be attached to aliphatic functionality. A number of methods are known to those 
practiced in the art [e.g, March, et al. supra, pp. 656-657 and 411-413; Wagaw et al., 
7. Am. Chem. Soc, 119: 8451-8458 (1997) and references therein; and Ahman et al. 
Tetrahedron Letters, 38: 6363-6366 (1997)]. When X is substituted upon aromatic 
and heteroaromatic functionality, an SjsjAr reaction may be performed. If the leaving 
group X is substituted upon aromatic, heteroaromatic or alkenyl functionality, a 
palladium-mediated (or related transition metal mediated) amine coupling reaction 
may be performed . Where X is substituted upon alkyl functionality, an S^2 or Sjsjl 
reaction may be performed. R 8 or R 9 may be H, or a straight chain or branched alkyl 
30 group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 or R 9 may also be appended with up to five R 10 groups (R 10 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl 
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group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



NHR 8 R 9 



CTBF — NR 8 R 9 



^ (ii). The amine BLFGs may also be prepared by a two step process. In 

the first step, substitution of the leaving group X with an amide, thioamide, urea, 
thiourea sulfonamide, carbamate, thiocarbamate (Z = S0 2) CO, CS, C0 2 , COS CSO 
CONRH, CSNRH) is performed (see 0 - 0 below). X may be a halide or OSO jR 
where R is substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH, and 
jq P henyl-N0 2 . The leaving group may be attached to aromatic or heteroaromatic 
functionality, alkenyl functionality, or it may be attached to aliphatic functionality. . 
When X is substituted upon aromatic or heteroaromatic functionality, an S N Ar 
reaction may be performed. When X is substituted upon aromatic, heteroaromatic, 
allylic or alkenyl a palladium-mediated (or related transtion metal mediated) amine 
1 5 C ° Upling reaCti ° n may be P erf °™ed. Where X is substituted upon alkyl functionality, 
an S N 2 or S N 1 reaction may be performed. Methods are known to those practiced in 
the art (March et al., supra, pp.425-427). In the second step, the acyl or sulfonyl 
functionality may be removed by methods known to those practiced in the art (such as, 
acidic or basic hydrolysis, or dissolving metal reactions for sulfonamides (Greene et 
2Q al., supra, 349-357 and 379-385). Milder conditions may be applied to more 
specialized groups, e.g., trifluoroacetamides may be cleaved by mild basic hydrolysis 
(Greene et al., supra, 353-354), and nitrosubstituted benzenesulfonamides may be 
cleaved by thiolate addition) to provide secondary amines (Fukuyama, et al 
Tetrahedron Leuers, 38: 5831-5834 (1997) and references therein). Alternatively for 
25 some derivatives, reduction will provide tertiary amines (e.g., March, et al., supra, pp. 
1212-1213). R8 to RH may be H, or a straight chain or branched alkyl group of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R8 to RH may also be appended wjth up tQ fiye Rl2 grQups ^ ^ 

heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy 
™ mercapto, phosphono, sulphono). R« to R l, may also be an ary , or ^ 
that is optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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CTBF —X -f N R 



CTBF 



R 9 R 9 

Z = S0 2t CO. CS, C0 2f COS, CSO, CONR 10 , CSNR 10 



(2) 



CTBF 



]— f/ I CTBF f— N 

R 9 V 



J_^ R8 auction , j-^p-j.^ 



CTBF 



(c). Alternatively the library may be assembled with ether BLFGs. The ether 
BLFGs may be prepared by displacement of leaving group LFGs with an alcohol. X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyI-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could 
be attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an S^Ar reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed performed [(e.g., March, et al., supra, pp. 654-655) and Hartwig et al. 
Tetrahedron Lett., 38: pp. 8005-8008). Where X is substituted upon alkyl 
20 functionality, an Sn2 or S^l reaction may be performed (e.g., March, et al., supra, pp. 

386-387 and 388-389). R 8 may be H, or a straight chain or branched alkyl group of 
length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the 
chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 

X + HO — R 8 » 



CTBF 



CTBF — O-R 



30 (d). Alternatively the library may be assembled with thioether BLFGs. The 

thioether BLFGs may be prepared by displacement of leaving group LFGs with an 
thiol. X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyI-CH 3 and phenyl-NO ? ). The halide 
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could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an SjsjAr reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an Sn2 or S^l reaction 
may be performed. These methods are known to those practiced in the art [(March et 
al., supra, 407-409) and (Peach in Patai The Chemistry of the Thiol Group, pt. 1, John 
Wiley & Sons, New York, 1974, pp. 721-735)]. R 8 may be H, or a straight chain or 
branched alkyl group of leftgth 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). The 
thioether may also be oxidized to the more water soluble sulfoxide or sulfone BLFGs 
(March et al., supra, pp. 1201-1203). 

X + HS— R 8 I CTBF f-S-R 8 



CTBF 



(e). Alternatively the library may be assembled with ester BLFGs. The ester 
BLFGs may be prepared by displacement of leaving group LFGs with a carboxylate. 
X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyI-CH 3 and phenyl-N0 2 ). The halide 
could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an SnAr reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related transtion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an S^2 or Sjsjl reaction 
may be performed. . Methods for these transformations are known to those practiced in 
the art [e.g., (March, et al., supra, pp. 398-399) and (e.g., Greene, et al., supra, pp. 
228-229)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 
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10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



CTBF — X + M + 'O 



CTBF 



(0- Alternatively the library may be assembled with thiolester BLFGs. The 
ester BLFGs may be prepared by displacement of leaving group LFGs with a 
thiolacid. X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide 
could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or 
it could be attached to an aliphatic functionality. When X is substituted upon aromatic 
or heteroaromatic functionality an S^Ar reaction may be performed. When X is 
substituted on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium- 
mediated, Cu mediated, or related trans tion metal mediated coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an Sn2 or S^l reaction 
20 may be performed. R 8 may be H, or a straight chain or branched alkyl group of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
25 substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono). 



_CTBFj-X + SH^R 8 | CTBF M 

(g). Alternatively the library may be assembled with urea or thiourea BLFGs. 
These BLFGs may be prepared by displacement of leaving group LFGs with a primary 
or secondary ureas (Z = O) or thioureas (Z = S). X may be a halide or a sulfonate 
group (OS0 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. CH,, CF,, 
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pheny!-CF 3 and phenyI-N0 2 ). The halide could be attached to aromatic or 
heteroaromatic- functionality, alkenyl functionality, or it could be attached to an 
aliphatic functionality. When X is substituted upon aromatic or heteroaromatic 
functionality an S^Ar reaction may be performed. When X is substituted on aromatic, 
heteroaromatic, allylic or alkenyl functionality a palladium-mediated, Cu mediated, or 
related transtion metal mediated coupling reaction may be performed. Where X is 
substituted upon alkyl functionality, an Sn2 or Sjsil reaction may be performed. R 8 to 
R 10 may be H, or a straight chain or branched alkyl group of length 1 to 10, which 
may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 10 
may also be appended with*up to five R 11 groups (R 11 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). . R 8 to R 10 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



^N^NF^R 9 - r~CTBF I — 



CTBF 



NR®R 



9 



\ 



R 10 R10 

Z = o,s 

(h). Alternatively the library may be assembled with sulfonamide BLFGs. 
The sulfonamide BLFGs may be prepared by displacement of leaving group LFGs 
with a sulfonamide. X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , pheny!-CH 3 and phenyl-N0 2 ). 
The halide could be attached to aromatic or heteroaromatic functionality, alkenyl 
functionality, or it could be attached to an aliphatic functionality. When X is 
substituted upon aromatic or heteroaromatic functionality an Sj^Ar reaction may be 
performed. When X is substituted on aromatic, heteroaromatic, allylic or alkenyl 
functionality a palladium-mediated, Cu mediated, or related transtion metal mediated 
coupling reaction may be performed. Where X is substituted upon alkyl functionality, 
an Sn2 or Sj^l reaction may be performed.. These methods are known to those 
practiced in the art [(March et al. f supra, pp.425-427) and (Fukuyama et al., 
Tetrahedron Letters, 38:5831-5834 (1997)]. R 8 may be a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
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P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



K - A ° 8 I - I ^ 



N R 
R 9 



CTBF 



/S-R 8 



R 9 



(i). Alternatively the library may be assembled with carbamate, thiocarbamate, 
JO or related BLFGs. These BLFGs may be prepared by displacement of leaving group 
with the corresponding carbamate, thiocarbamate, or related BLFGs. X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH*, CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached 
15 to an aliphatic functionality. When X is substituted upon aromatic or heteroaromatic 
functionality an S^Ar reaction may be performed. When X is substituted on aromatic, 
heteroaromatic, allylic or alkenyl functionality a palladium-mediated, Cu mediated, or 
related transtion metal mediated coupling reaction may be performed. Where X is 
substituted upon alkyl functionality, an Sn2 or Sjsjl reaction may be performed. R 8 to 
20 R 9 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 9 may 
also be appended with up to five R 11 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). . R 8 to R 10 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono. 



CTBF 



CTBF 



R 9 

™ 2 = 0, S;Y = O.S 

(j). Alternatively the library may be assembled by BLFGs where the leaving 
group is replaced with carbon-based functionality. 
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(i). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or tfrisubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyI-CH 3 and phenyl-N0 2 ). 
The metal (M) may be any metal, but the prefered metals are BL , SnL , ZnL , ZrL , 
CuL n , SiL n , Ti L n , Al L n , and L n where L corresponds to metal ligands (e.g., halide, 
5 alkoxide, alkyl, aryl, alkenyl, heteroaryl, phosphine sulfide, amido) many of which are 
acceptable and are known to those practiced in the art. The halide could be attached to 
aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached 
to an aliphatic functionality. Addition can occur directly, or can be catalyzed by 
transition metals. Numerous methods are known to those practiced in the art [e.g. 
10 (Hegedus, L. S. (1994) in Transition Metals in the Synthesis of Complex Organic 
Molecules pp. 65-129, University Science Books, Mill Valley), (Knochel et ak 
Chemical Reviews 93:2117-2188 (1993)), (Miyaura et al.. Chemical Reviews 95: 
2457-2483 (1995)), and (Farina et al.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1997, vol. 50, pp. 1-653)]. R 8 may be H, 
15 or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

X + m— R 8 > 



20 



30 



CTBF 



CTBF 



(ii). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
25 The BLFGs may be prepared in a two step process. In the first step, an 
organometallic reagent is prepared where the metal (M) may be any metal, but the 
prefered metals are BL„ SnL t( , ZnL , ZrL , CuL n , SiL n , Ti L„, Al L n , and L n where L 
corresponds to metal ligands (e.g., halide, alkoxide, alkyl, aryl, alkenyl, heteroaryl, 
phosphine sulfide, amido) many of which are acceptable and are known to those 
practiced in the art. In the second step, carbon-carbon bond formation is performed to 
generate the BLFGs. Numerous methods for this two step process are known to those 
practiced in the art [e.g. (Hegedus, L. S. (1994) in Transition Metals in the Synthesis of 
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Complex Organic Molecules pp. 65-129, University Science Books, Mill Valley), 
(Knochel et al. Chemical Reviews 93:2117-2188 (1993)), (Miyaura et al.. Chemical 
Reviews 95: 2457-2483 (1995)), and (Farina et al.(Paquette, Series Editor in Chief), 
Organic Reactions, John Wiley & Sons, New York, 1997, vol. 50, pp. 1-653)]. R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

(1) I CTBF J — X * I CTBF I — M 



15 



20 



25 



30 



(2) 



CTBF — M + 



R 8 X 



CTBF 



(iii). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyI-N0 2 ). 
The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A palladium-mediated Heck reaction or related transtion metal 
mediated coupling reaction would be performed [e.g., (de Meijere, A. et al., Angew. 
Chem. Int. Ed. Engl, 33:2379-2411 (1994) and references therein) and Heck, 
Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. For the 
alkene, R 8 - R 9 may be straight chain or branched alkyl groups of length 1 to 10, which 
may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 - R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 - R 9 may also be aryl or heteroaryl groups that are optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



*10 



CTBF 



H- 



CTBF 




,10 
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CTBF —X + 
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R 8 




R 8 
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CTBF 
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CTBF 
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CTBF 



H 
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CTBF 
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(iv). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A copper-mediated, palladium-mediated reaction, or related 
transtion metal mediated coupling reaction would be performed. Numerous methods 
are known to those practiced in the art [e.g., (March et aL, supra, p. 481), 
(Sonagashira, K. in Comprehensive Organic Synthesis; Trost, B. M., Fleming, I., Eds.; 
Pergamon Press: New York, 1991, vol. 3 pp. 521-549), and (Rossi et aL, Org. Prep. 
Proced. Int. 27:129-160 (1995)]. For the alkyne, R 8 may be H, straight chain or 
branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 
groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be a aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

X + =^-R 8 ^ 



CTBF 



CTBF 



70. The alkenyl group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the alkenyl group. These CTBFs may be 
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available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

CTBFs may be represented as follows, where as many as four CTBF fragments 
(A through D) may be present on the alkene functionality. 

5 



10 



10 



CTBF Part A 



CTBF Part B 



CTBF Part C 



CTBF Part D 



(b). Alternatively the library may be assembled with modified alkene BLFGs. 

(i). The alkene BLFGs may be prepared by performing a transition 
metal-mediated (typically Pd) catalyzed Heck reaction or related reactions on 
trisubstituted, disubstituted or the vinyl functionalized CTBFs [e.g., (de Meijere, A. et 
al., Angew. Chem. Int. Ed, EngL y 33:2379-2411 (1994) and references therein) and 
j 5 Heck, Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. R 8 
may be a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
20 R8 may a i S o be a aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). X may be a halide or a sulfonate group (OS0 2 R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-NQ 2 ). 



CTBF Part A 



25 CTBF Part B 



H c " c ' * — i °" A y e 

J H 1 CTBF PatlB \ >Q 



CTBF Part C 



(ii). The alkene BLFGs may be prepared by performing a transition 
metal (typically Ru or Mo) catalyzed olefin metathesis reaction [e.g., (Grubbs et al. 
Accounts of Chemical Research 28:446-452 (1995)) and (Schuster et al. Angewandte 
Chemie-Intemational Edition in English 36:2037-2056 (1997))]. R 8 - R 14 may be 
straight chain or branched alkyl groups of length 1 to 10, which may incorporate from 
1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 - R 14 may also be appended with 
up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
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amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 - RH may 
also be aryl orTieteroaryl groups that are optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 

5 | CTBP part A I P' R™ H" ^ I CTBF Part A L b" | CTBF PartA~~k P" 

J~R» + R» R" I CTBF PartB~7 ~V | CTBF Part B~T ~V» 



CTBF Part B 



11. Methods to Covalently Bond TBFs using the LFG with a Bifunctional Linker 
(BFL) to Produce Candidate Crosslinked Target Binding Molecules (CXL-TBFs). 



10 



TBF m 


—LFG + 


TBF n 


—LFG + 


BFL 



CXL-TBF 



Upon identification of the TBFs, crosslinking is accomplished with a BFL. All 
of the chemistry described for preparation of the BLFGs could be employed in the 
crosslinking step. One method would be to crosslink TBFs that have the same LFGs 
and employing the same crosslinking chemistry. A second method would be to 
J5 crosslink TBFs that have the same LFGs, but employing different crosslinking 
chemistry. A third method would be to crosslink TBFs that have different LFGs 
which is most but not all cases would require different corsslinking chemistry. Each 
of these strategies use known methods to those practiced in the art. Examples of the 
three methods are provided below. 
20 Method 1. For example, two TBFs with aldehyde LFGs could be crosslinked 

employing a BFL that incorporates two O-substituted hydroxylamines. 

QBfThcHO + rTiF7}-CHO + ^V£j^}-o' NH * ► G^^g=N Vo _^-_^,^C-£^ 



25 



10 



Method 2. For example, two TBFs with aldehyde LFGs could be crosslinked 
employing a BFL that incorporates one O-substituted hydroxylamine and one acyl 
hydrazide. 



TBF. 



Method 3. For example, one TBF with an aldehyde LFG and one TBF with an 
amine LFG could be crosslinked employing a BFL that incorporates one O-substituted 
hydroxylamine and one carboxylic acid. 



A hypothetical example for the preparation of heterolinkers would be to 
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employ amino acids as heterolinkers. As shown below, the amino acid heterolinker 
serves to link TBFs with carboxylic acid LFGs to TBFs with amine LFGs. The amine 
LFG could be a primary amine or a secondary amine (not shown). Many methods that 
are known to those practiced in the art could be used to prepare the CXL-TBFs using 
amino acid heterolinkers. One of the methods is described below. 



For this sample method, the amino acid BFL is protected as the N-tert- 
butoxycarbonyl (Boc) derivative. Many Af-Boc amino acids are commercially 
available, e.g., Novabiochem (San Diego, CA) and Neosystems (Strasbourg, France). 
N-Boc protected amino acids can also be prepared by known methods to those 
practiced in the art (Bodansky et al., The Practice of Peptide Synthesis, Springer- 
Verlag, Berlin, 1984, 18-20). 

The sample method is illustrated in Scheme 1. Sample experimental 
procedures are provided below. 

Step 1 (Desai, et al.. Tetrahedron Letters, 34, 7685*7688, (1993)) 

To a suspension of polymer-bound carbodimide (1.5 mmol) in chloroform (10 
mL) is added the TV-Boc protected amino acid 2 (0.55 mmol) and the amine-substituted 
TBF 1 (0.50 mmol). After the reaction mixture is shaken overnight at room 
temperature, the mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) 
and the combined filtrate is evaporated in vacuo to yield 3. 



To compound 3 (0.5 mmol) is added 5 mL of a solution of 4.0 M hydrochloric acid in 
dioxane (Aldrich, Milwaukee, WI). The solution is stirred for one hour at room 
temperature and then evaporated to remove the solvent and excess hydrochloric acid. 
The product is diluted with 5 mL of methanol and concentration is repeated to provide 
4. 

Step 3 (Desai. et al.„ Tetrahedron Letters. 34, 7685-7688, (1993)) 

To a suspension of polymer-bound carbodiimide (1.5 mmol) in chloroform (10 
mL)is added intermediate 4 (0.50 mmol) and the carboxylic acid-substituted TBF 5 
(0.55 mmol). After the reaction mixture is shaken overnight at room temperature the 
mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) and the 
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combined filtrate is evaporated in vacuo to yield product 6, which is the desired CXL- 
TBF. 

Scheme 1 

Step 1: 

w n 

BocHN — I Linker 



5 | TBF n + BOCHN- H Linker t -V^ O" 1 ^ 0 N ^ BocHN - 4T^nke71 ^ 



O 

2 3 



10 



Step 2: 



BocHN — | Linker f -^ 



TBF, 



□ 



4 M HCI in dioxane 



Cl H3N — | Ltnkerj —^ «- 



TBF„ 



Step 3: 

OO ^ x O 

4 

5 6 

20 EXPERIMENTAL 

Unless otherwise noted, materials were obtained from commercial suppliers 
and used without further purifications. Aldehydes were purchased from Aldrich 
Chemical Company, Inc. (Milwaukee, WI). Anhydrous dimethylsulfoxide (DMSO) 
and acetic acid were purchased from Fischer (Pittsburg, PA). Soluble CD4 (sCD4) 
25 was purchased from Intracel Corporation (Issaquah, WA), gp!20 and anti-gp!20 
antibody were purchased from DuPont (Wilmington, DE) and o-phenylenediamine 
peroxidase substrate tablet sets were purchased from Sigma Chemical Co. (St. Louis, 
MO). Reactions were carried out in commercially available Beckman 2 ml deep-well 
microtiter plates. 
K) EXAMPLE I 

Pharmacoph ore Recombination for the Identification of Compounds Capable of 
Inhibiting the Interaction Between gp!20 and CD4 
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To demonstrate the principle of pharmacophore recombination, we established 
a biochemical screen for the inhibition of gpl20-CD4 binding. This assay measures 
the ability of small molecules to inhibit the binding of gpl20 to sCD4 that is 
immobilized on a microtiter plate. Binding of sCD4 was quantified with an anti- 

5 gpl20 antibody conjugated to horseradish peroxidase. 

General procedure for the synthesis of an oxime compound library 

For several reasons, we chose to initially employ O-methyl oximes, rather than 
aldehydes, for the initial compound building block library. First, O-methyl oximes 
best model the pharmacophore units in the final oxime coupled dimers. Second, O- 

3 methyl oximes are more soluble in aqueous solution than their more hydrophobic 
aldehyde precursors. Also, the oxime functionality is clearly not inherently toxic and 
does not interfere with good pharmacokinetics or cell permeability since oximes are 
present in many drugs. Finally, the O-methyl oximes are easily prepared in a single 
step condensation of aldehydes with O-methyl hydroxylamine, without requiring 
purification of the resultant product. The chemical condensation of an aldehyde with 
O-methyl hydroxylamine to provide an oxime compound is shown in Figure 1. 

In the first step of the method, the initial oxime library was synthesized by 
separately condensing O-methyl hydroxylamine with 252 different aldehydes in a 
DMSO solution. The oxime library was prepared in a spatially separate fashion in a 
microtiter plate format such that each well contained a single oxime compound. More 
specifically, in each well of a microtiter plate, a DMSO solution of an unique aldehyde 
(0.188 ml, 0.15 M, 0.028 mmol) was added. To this solution, a DMSO solution of O- 
mcthyl hydroxylamine (0.083 ml, 0.5 M, 0.042 mmol) was then added followed by 
addition of a DMSO solution of acetic acid (0.023 ml, 0.5 M, 0.01 1 mmol). The plates 
were allowed to sit at room temperature overnight during which time condensation 
occurred, thereby providing the 252 member library of oxime compounds. 
Assay to determine which oxime compounds are capable of inhibiting the 
interaction between gpl20 and CD4 

The 252 member oxime compound library prepared as described above was 
then screened for the presence of compounds capable of inhibiting the interaction 
between gp!20 and CD4 in a standard ELISA assay. For the gpl20-CD4 ELISA 
assay, an ImmuIon-2 microtiter plate was incubated overnight at 4 °C with 70 ng of 
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sCD4 in 100 pil of carbonate buffer. The solution was removed from the plate and 
washed three tim&s with phosphate buffered saline (PBS) at pH 7.4. The plate was 
blocked with 150 PBS-Tween-BSA (0.5% BSA, 0.05% Tween-20) for 1 h at room 
temperature and then washed again. gpl20 (1 ng) in 50 jil of PBS and 50 (il of test 
organic oxime compound (3 mM), 40 ^1 PBS, 10 [il were added and incubated for 1 h 
at room temperature. The plate was then washed and 100 |al of anti-gpl20 conjugated 
horseradish peroxidase was added and incubated for I h at room temperature. The 
bound gpl20 was then quantitated with o-phenylenediamine as a substrate. 

The results of these assays demonstrated that 30 of the 252 oxime compounds 
were capable of inhibiting the interaction between gpl20 and CD4, wherein the 
approximate EC 50 values ranged from about 20 j^M to 500 pM. The structurally 
related aldehyde analogs of 30 of these oximes showed diverse structural motifs 
including chromones, phenols and furans (see Figure 2). 

Cross-linking of the top 30 structurally related aldehyde analogs to produce a 
library of candidate compounds and screening of those candidate compounds for 
the ability to inhibit the interaction between gpl20 and CD4 

Each of the 30 structurally related aldehydes analogs corresponding to the 30 
oxime compounds identified above as being capable of inhibiting the interaction 
between gpl20 and CD4 were individually coupled to each of the other 29 aldehydes 
with a variety of linkers to produce a library of candidate compounds for binding to 
the target molecule. Each of the 30 individual aldehydes was linked with another 
aldehyde through an CO'-diamino-alkanediol linker to obtain the library of cross- 
linked candidate compounds. Each aldehyde combination was kept spatially separate, 
but an equimolar mixture of five different 0,0'-diamino-alkanediol linkers were used 
in each coupling reaction to provide a 450-member library of cross-linked candidate 
compounds. The chemistry used for preparation of the compounds is shown in Figure 
3 showing the synthesis of both homodimers and heterodimers. 

The five linkers employed each consisted of two hydroxylamine groups 
tethered to an aliphatic chain having either two, three, four, five or six methylene units. 
This allowed us to evaluate any distance dependency the two pharmacophores may 
have in the binding site. Linkers were prepared as follows. To a round-bottomed flask 
was added alkyl dibromide (20.2 mmol), AMiydroxyphthalimide (36.8 mmol, -1.8 
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equiv) and dimethylformamide (90 ml). The flask was cooled to 0 °C and 1,8- 
diazabicyc!o[5.4.0]undec-7-ene (40.5 mmol) was added dropwise with stirring. The 
reaction was allowed to warm to room temperature and was then stirred overnight. 
The reaction mixture was then poured into 1M HC1 (500 ml). The resulting white 
5 solid precipitate was washed with water (3 x 50 ml) and methanol (3 x 50 ml) and sent 
onto the next step without further purification. 

13,9 mmol of the crude bis-TV-alkoxyphthalimide was then added to a round- 
bottomed flask in combination with dimethoxyethylene glycol (150 ml). To another 
flask was added hydrazine monohydrate (41.8 mmol, 3 equiv.) and dimethoxyethylene 
'0 glycol (100 ml). The suspension of bis-7V-alkoxyphthalimide was added slowly with 
stirring to the hydrazine solution. The flask was refluxed for 3 h, allowed to cool to 
room temperature and the resulting precipitate was filtered away. The remaining 
supernatant solution was concentrated and the resulting yellowish-oil was purified by 
Kugelrohr distillation (0.01 mm Hg, 60-70 °C) and column chromatography (89:9:2 
1 5 CHCl 3 /MeOH/NH 4 OH). 

For 0,0'-diamino-l,4-butanediol), the general synthesis procedure described 
above was followed. IR (film from CnfilJ: 3412.8,3310.0,2942.7,2866.3 cm 1 . 'H 
NMR (400 MHz, CDC1 3 ): 5 5.31 (br s, 4H), 3.65 (m, 4H), 1.57 (m, 4H). ,3 C NMR 
(400 MHz, CDC1 3 ): 5 75.5, 24.8. Anal. Calcd for C 4 H I2 0 2 N 2 : C, 39.99; H, 10.07; N, 

>0 

23.32. Found: C, 40.17; H, 9.90; N, 23.12. 

Once the five linkers were obtained, the 450 linked aldehyde combinations 
were prepared as follows. In each well, a DMSO solution of each of two different 
aldehydes (0.045 ml, 0.15 M, 0.007 mmol each) was added. To this solution, a DMSO 
solution of an equimolar mixture of the five linkers (0.025 ml, 0.3 M of each linker, 
0.007 mmol) was added followed by a DMSO solution of acetic acid (0.005 ml, 0.5 M, 
0.003 mmol). The plates were allowed to sit at room temperature overnight to allow 
for potential ligand formation. 

Each of the 450 members of the potential ligand library was then tested at a 
concentration of 100 jxM each (i.e., a concentration that is 10-fold more dilute than the 

0 

concentration employed in the initial oxime monomer screen) for the ability to inhibit 
the interaction between gp!20 and CD4 using the ELISA assay described above. The 
results from these assays demonstrated that more than 300 of the 450 members of the 
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potential ligand library showed greater than 50% inhibition activity at the 
concentration employed. When the 450 members of the potential ligand library were 
tested at a concentration of 1 ^iM each for the ability to inhibit the interaction between 
gpl20 and CD4, 17 of the ligands showed greater than 50% inhibitory activity at that 
5 concentration. The chemical structures of 12 of the 17 most active aldehyde 
precursors are shown in Figure 4. 
Evaluation of linker length dependence on activity 

The 17 linked aldehyde combinations with the greatest activity were then 
resynthesized as described above with a unique linker per well (85 separate wells) so 
) as to evaluate any linker-length dependency on binding. Screening each of the 85 
ligand compounds at 1 jjM each showed that ligands that incorporated pharmacophore 
1 (see Figure 5) were more potent than the other ligands and that there indeed was a 
dependence upon linker length. Specifically, ligands with linkers having either 4 or 5 
methylene units were much more active than ligands with linkers having 6 methylene 
units, whereas ligands with linkers having either 2 or 3 methylene units were only 
slightly more active than ligands with linkers having either 4 or 5 methylene units. 
Biological and Analytical Characterization of Representative Ligands 

One of the ligands that exhibited strong activity for inhibiting the interaction 
between gpl20 and CD4 (as shown in Figure 6) was resynthesized with each of the 
five different linkers on a large scale and was then purified by column 
chromatography. Column chromatography purification enabled isolation of the 
heterodimer from the homodimer. 

Large scale synthesis of organic oxime compounds was performed as follows. 
To a flame-dried round-bottomed flask was added aldehyde (0.82 mmol) and DMSO 
(8 ml). A 0.9 M 0-methyl hydroxylamine (1.4 ml) was then added and the reaction 
mixture was allowed to stir at room temperature overnight. The reaction was poured 
into methylene chloride (50 ml), washed with Hf> (3 x 20 ml), dried and concentrated. 
Silica gel chromatography provided the pure organic oxime compounds. 

The oxime compounds made by this method were characterized as follows. 
(1) O-methvl oxime of 6-nitropiperonal 

Reaction of 6-nitropiperonal with 0-methyl hydroxylamine provided 
predominantly one oxime isomer which was purified by silica gel chromatography 
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(10:90, EtOAc/hexanes). 'H NMR (400 MHz, CDC1 3 ): 5 8.61 (s, 1H), 7.53 (s, 1H), 
7.37 (s, 1H), 6.15 (s, 2H), 4.00 (s, 3H). Anal. Calcd for C 9 H 8 O s N 2 : C, 48.22; H, 3.60; 
N, 12.50. Found: C, 48:40; H, 3.75; N, 12.56. 

5 (2) O-methyl oxime of 6,8-dichloro-3-formylchromone 

Reaction of 6,8-dichloro-3-formylchromone provided 1:1 cis/trans isomers 
which were isolated by silica gel chromatography (20:80, CH^Cl/hexanes). 

Isomer 1: 'H NMR (400 MHz, CDCI 3 ): 5 8.48 (s, 1H), 8.24 (s, 1H), 8.12 (d, 1H, J = 
2.5), 7.75 (d, 1H, J = 2.5), 3.97 (s, 3H). Anal. Calcd for C II H 7 0 3 NC1 2 : C, 48.56; H, 
2.59; N, 5.15. Found: C, 48.44; H, 2.47; N, 5.03. 

Isomer 2: ! H NMR (400 MHz, CDC1 3 ): 8 9.45 (s, 1H), 8.12 (d, 1H, J = 2.5), 7.75 (d, 
2H, J = 2.5), 4.07 (3H). Anal. Calcd for C II H 7 0 3 NC1 2 : C, 48.56; H, 2.59; N, 5.15. 
Found: C, 48.45; H, 2.49; N, 5.11. 

Large scale synthesis of oxime ligands was performed as follows. To a flame- 
dried round-bottom flask was added 10 ml of DMSO and 1.03 mmol of each of the 
two aldehydes to be incorporated into the ligand. After all solids were dissolved, a 
solution of the appropriate linker (1.24 mmol) in 1 ml of DMSO was added dropwise, 
followed by the addition of acetic acid (0.72 mmol). The reaction mixture was 
allowed to stir at room temperature overnight. The reaction was then poured into 
methylene chloride (50 ml), washed with F^O (3 x 20 ml), dried and concentrated. 
Silica gel chromatography provided the isolated homo/heterodimers. Cis/trans 
isomers, when present, were not separated and were purified as mixtures of isomers. 
The oxime dimers made by this method were characterized below. 
(1) Oxime heterodimer of 6,8-dichloro-3-formvIchromone and 6-nitropiperonal, 
linker containing 4 methylene units 

The heterodimer was separated from the homodimers by silica gel 
chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
characterized as a 1:1 mixture of cis/trans isomers. Anal. Calcd for C 22 H l7 O g N 3 Cl 2 : C, 
50.59; H, 3.28; N, 8.05. Found: C, 50.70; H, 3.40; N, 7.89. 
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Isomer 1: 'H NMR (400 MHz, CDCI 3 ): 5 9.42 (s, 1H), 8.55 (s, 1H), 8.00 (s, 1H), 
7.67 (m, 2H); T.59-(s, 1H), 7.27 (s, 1H), 6.09 (s, 2H). 4.22 (m, 4H), 1.84 (m, 4H). 

Isomer 2: 'H NMR (400 MHz, CDC1 3 ): 5 8.54 (s, 1H), 8.41 (s, 1H), 8.16 (s, 1H), 
s 7.99 (s, 1H), 7.67 (s, 1H), 7.40 (s, 1H), 7.24 (s, 1H), 6.08 (s, 2H), 4.22 (m, 4H), 1.83 
(m, 4H). 



10 



15 



20 



(2) Oxime heterodimer of 6.8-dichloro-3-for m vlchromone and 6-nitrop iperonal, 
linker containine 5 methylene units 

The heterodimer was separated from the homodimers by silica gel 
chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
characterized as a 1.5:1 mixture of cis/trans isomers. Anal. Calcd for C 23 H„O g N 3 Cl 2 : 
C, 51.51; H, 3.57; N, 7.83. Found: C, 51.68; H, 3.70; N, 7.66. 

Isomer 1: 'H NMR (400 MHz, CDC1 3 ): 5 8.62 (s, 1H), 8.47 (s, 1H), 8.24 (s, 1H), 8.10 
(s, 1H), 7.74 (s, 1H), 7.73 (s, 1H), 7.37 (s, 1H), 6.14 (s, 2H), 4.20 (m, 4H), 1.8 (m, 
4H), 1.53 (m, 2H). 

Isomer 2: 'H NMR (400 MHz, CDCI 3 ): 5 9.47 (s, 1H), 8.61 (s, 1H), 8.1 1 (s, 1H), 7.75 
(s, 2H), 7.48 (s, 1H), 7.33 (s, 1H), 6.14 (s, 2H), 4.30 (t, 2H, J = 6.6), 4.20 (m, 2H), 
1.80 (m, 4H), 1.53 (m, 2H). 

These purified heterodimers and homodimers were then tested as described 
above for the ability to inhibit the interaction between gpl20 and CD4. The results of 
these assays demonstrated that heterodimers shown in Figure 6 having a linker 
containing from 2 to 5 methylene units exhibited EC S0 's ranging from 0.6 to 1.5 p.M 
25 and showing 10- to 20-fold enhancement in inhibitory activity over the compound 
shown in Figure 5 (EC,, in the range of about 10-15 uM). The other compound that 
was incorporated into the heterodimer had an EC 50 of greater than 50 \xM. 

The heterodimers shown in Figure 6 having linkers containing from 2 to 5 
methylene units are of comparable potency to the most potent compounds that have 
10 been identified to date that block the CD4/gpl20 interaction (Tanaka et al., J. 
Antibiotics 50:58 (1997), Sun et al., J. Antibiotics 49:689 (1997), Jarvest et al.. Bio. 
Med. Chem. Lett. 3:2851 (1993) and Chen et al., Proc. Natl. Acad. Sci. USA 89:5872 
(1992)). In addition, these ligand heterodimers are considerably less complex than 
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previously identified compounds with comparable activity. Further optimization of 
the optimal building block and linker combinations could presumably be accomplished 
by evaluating a larger range of linkers with enhanced rigidity or by incorporating 
analogs of the optimal aldehyde precursors. 
5 EXAMPLE 2 

Pharmacophore Recombination Using NJV-Dimethvlamines and Other Diamine 

Linkers 

In addition to the use of aldehydes and oximes for the pharmacophore 
recombination method as described above, additional chemistries also find use. In this 
10 example, the organic compound building blocks are N,N-dirnethylamine compounds 
that are prepared by reductive amination of starting aldehydes and dimethylamine 
using support-bound triacetoxyborohydride (Kaldor et aL, Tetrahedron Lett. 37:7193- 
7196 (1996)). The chemistry of these reactions is shown in Figure 7. Removal of the 
support-bound reducing agent by filtration followed by concentration to remove the 
15 volatile, excess dimethylamine then provides the pure N,N-dimethylamine monomer 
building blocks. Alternatively, the N,N-dimethylamine building blocks may be 
obtained by reduction using a sodium borohydride-based reducing agent in solution. 
The resulting amine product is then isolated from the excess reducing agent or 
aldehyde by passing down an acidic ion exchange column. The amine product is then 

20 obtained by elution from the ion exchange column with a volatile amine such as 
ammonia followed by concentration. 

Linkage of the N,N-dimethyIamine building blocks can be accomplished 
through the use of diamine linkers of which many are commercially available and 
many more can be readily prepared using well known methodology. The commercial 

25 availability of the diamine linkers allows rapid optimization of linker length, rigidity 
and orientation. An exemplary synthesis sequence is shown in Figure 8. Specifically, 
support-bound chloride (3) (or other support-bound halide) is treated with excess of a 
diamine to provide an amine-derivatized support (4). Acylation of the amine 
functionality then provides support-bound formamide or carbamate (5). Reduction 

3° then provides support-bound secondary amine (6). Reductive amination then 
introduces one of the pharmacophore elements (7). Acid treatment then releases a 
secondary amine from the support, which can then be treated with the second 
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pharmacophore monomer and sodium triacetoxyborohydride to provide the desired 
pharmacophore heterodimer (either support-bound reagent or alternative scavenging 
methods may be employed). Initial attachment of the diamine to the support can be 
accomplished using other support-bound alkyl halides or could be accomplished by 
5 reductive amination of a support-bound aldehyde or ketone. Fewer linkers are 
available that contain two secondary amine groups, but these can also be incorporated, 
in this case, the acylation step (4 to 5 in Figure 8) and the subsequent reduction step (5 
to 6 in Figure 8) would be eliminated. 

The foregoing description details specific methods which can be employed to 

10 practice the present invention.- Having detailed such specific methods, those skilled in 
the art will well enough know how to devise alternative reliable methods at arriving at 
the same information in using the fruits of the present invention. Thus, however, 
detailed the foregoing may appear in text, it should not be construed as limiting the 
overall scope thereof; rather, the ambit of the present invention is to be determined 

15 only by the lawful construction of the appended claims. All documents cited herein 
are expressly incorporated by reference. 



20 



25 



30 
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WHAT TS CLAIMED IS : 

1. A method for identifying a drug lead compound that binds to a target 
biological molecule (TBM) of interest, the method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF) 
5 capable of being chemically cross-linked by a cross-linker to provide candidate cross- 
linked target binding fragments for binding to the target biological molecule; 

(b) screening the library of candidate target binding fragments to identify at 
least first and second candidate target binding fragments that bind to the target 
biological molecule; 

10 ( C ) chemically cross-linking the at least first and second candidate target 

binding fragments or structurally related analogs thereof with a cross-linker to provide 
a library of candidate cross-linked target binding fragments for binding to the target 
biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound 

1 5 that binds to the target biological molecule. 

2. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to 
the target biological molecule with a K d of from about 5 mM to about 0.05 mM. 

20 

3. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to 
the target biological molecule with a K d of from about 3 mM to about 0.1 mM. 

25 4 The method according to Claim 1, wherein the drug lead compound 

identified in step (d) binds to the target biological molecule with a K d of 500 nM or 
lower. 

5. The method according to Claim 1, wherein the screening steps (b) and 
(c) consist essentially of an in vitro biological assay. 
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6. The method according to Claim 1, wherein the library of candidate 
cross-linked target binding fragments for binding to the target biological molecule 
comprises homodimeric or heterodimeric candidate cross-linked target binding 
fragments. 

5 

7. The method according to Claim 1, wherein the library of candidate 
target binding fragments comprises candidate target binding fragments of less than 
about 500 daltons. 

10 8. The method according to Claim 1 wherein the library of candidate 

cross-linked target binding fragments comprises candidate cross-linked target binding 
fragments of less than about 750 daltons. 

9. The method according to Claim 1, wherein the target biological 
1 5 molecule is a human or human pathogen protein. 

10. The method according to Claim 9, wherein the protein is an enzyme, a 
human hormone, a human receptor and fragments thereof having nitrogen's in the 
protein present in their naturally occurring isotopic abundance. 

20 

11. The method according to Claim 1, wherein at least one of the screening 
steps (b) and (d) is accomplished by ELISA assay. 

12. A method for inhibiting the binding of a first biological molecule to a 
25 second biological molecule that binds to the first biological molecule, the method 

comprising: 

contacting a system comprising both the first and second biological molecules 
with a binding inhibitory amount of a drug lead compound identified according to the 
method of Claim 1, wherein the drug lead compound binds to the first biological 
10 molecule and inhibits its ability to bind to the second biological molecule. 
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13. The method according to Claim 12, wherein the first and second 
biological molecules are human proteins. 

14. The method according to Claim 13, wherein the first and second 
5 biological molecules are selected from the group; a human hormone, cytokine or 

chemokine, a human receptor and fragments thereof 

15. A method for identifying a drug lead compound that binds to a target 
biological molecule of interest, the method comprising: 

10 (a) assenbling a library of candidate target binding fragments, each 

fragment containing an oxime linking group; 

(b) screening the library of candidate target binding fragments or 
monomers to identify at least first and second oxime containing candidate target 
binding fragments that bind to the target biological molecule; 
15 ( c ) chemically crosslinking the aldehyde analogs of the at least first and 

second oxime containing candidate target binding fragments with an 0,0'-diamino- 
alkanediol cross-linker to provide a library of oxime containing candidate cross-linked 
target binding fragments for binding to the target biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound 
20 that binds to the target biological molecule. 

16. A method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF), 
each fragment having a linkable functional group (LFG) or blocked form thereof 

25 (BLFG), the blocked form containing a linking group (LG); 

(b) contacting the candidate target binding fragments with a target 
biological molecule (TBM); 

(c) measuring a change in a first physical association (PA-1) of the target 
biological molecule; 

■™ (d) selecting target binding fragments (TBF) based on (c); 

(e) reacting selected target binding fragments having a linkable functional 
group with a cross-linker, having chemically compatible cross-reactive groups (CFG) 
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with the LFG, under conditions suitable for forming a library of candidate cross-linked 
target binding-fragments (CXL-TBF); 

(0 contacting the candidate cross-linked target binding fragments with the 
target biological molecule (TBM); 

5 (g) measuring a change in a second physical association (PA-2) of the 

target biological molecule; 

(h) selecting cross-linked target binding fragments (XL-TBF) based on (g). 

17. The method of Claim 16 wherein the candidate target binding fragment 
0 contacted with the TBM contains a blocked linkage functional group (BLFG) 

containing linking group LG. 

18. The method of Claim 17 where in the linking group (LG) in BLFG is 
selected from the group; oxime, hydrazone, N-acyl hydrazone, secondary amine, 

> tertiary amine, acetal, ketal, 1,2 amino alcohols, amide, N,N-disubstituted amides, 
thioamide, ureido, thioureido, carbamate, thiocarbamate, thiothiocarbamate, 
sulfonamide, carbonate, guanidino, amidino, thioester, ester, ether, 2-hydroxyether, 2- 
hydroxythioether, thioether, disulfide, alkane (alkylene), alkene (alkenylene) and 
alkyne (alkynylene). 

19. The method of Claim 18 wherein each candidate target binding 
fragment (CTBF) of step (b) contains the same linking group (LG) as is present in the 
candidate cross-linked target binding fragment (CXL-TBF) of step (f). 

20. The method of Claim 19 wherein the linking group (LG) is selected 
from the group; oxime, secondary amine, tertiary amine, amide, ureido, thioureido, 
sulfonamide and carbamate. 

21. The method of Claim 19 wherein two candidate target binding 
fragments (CTBF) selected from step (d) are cross-linked to form a candidate cross- 
linked target binding fragment (CXL-TBF) represented by 
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TBF m 


LCh— 


XL 


LG^— 


TBF n 



where 

TBF m is a first TBF selected from step (d) which contained LG, in its blocked 
linking group ; 

TBF n is a second TBF selected from step (d) which contained LG 2 in its 
blocked linking group; 

XL represents the cross-linker without the chemically compatible cross- 
reactive functional groups; 

LG, represents the linking group in the first TBF; and 

LG 2 represents the linking group in the second TBF. 



22. The method of Claim 21 wherein LG, and LG 2 are the same or are 
different and are selected from the group oxime, secondary amine, tertiary amine, 
amide, ureido, thioureido, sulfonamide and carbamate. 

23. The method of Claim 21 wherein TBF m and TBF n are the same or are 
different. 

23. The method of Claim 21 where XL is selected from the group of 
alkanes: methylene, ethylene, propylene, butylene, pentylene, hexylene and 
hepty!ene,optionally containing 0, 1, 2 or 3 ether linkages and from 1-3 double bonds 
and aryls: ortho-, meta- or para- C 0 -C 6 -alkyl-phenyl-C 0 -C 6 -alkylene . 

24. The method of Claim 16 wherein the candidate cross-linked target 
binding fragments are represented by the formula: 



TBF 



m 



J LGt— | XL | LGrg — | TBF n " 



30 where 



TBF m represents a first TBF selected from step (d); 
TBF (i represents a second TBF selected from step (d); 
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XL represents the cross-linker without the chemically compatible cross- 
reactive functional -groups selected from the group 
C 0 -C 10 -alkylene, 

C 0 -C 6 -alkyl-C 6 -C 10 -aryl-C 0 -C 6 -alkylene, 
5 C.-Q-alkyl-NCR.J-C.-CValkylene, 

(C.-Q-alkyl-O-C.-Q-alkylene),,, where n=l, 2, 3 or 4; 
LG, and LG 2 are linking groups independently selected from the group 
-C(R 0 )=N-O-, -0-N=C(R>, -CHj-NCR.)-, -N(R>CJL-, 
-C(=0)-N(R>, -N(R.)-C(=0)-, -N(R>C(=0)-0-, -0-C(=0)-N(R>, 
1 0 -N(R>C(=0>-N(R b )-, -N(R.)-C(=0)-N(R b )-, -SO r N(R.)- and 

-N(R>SCy; 

R.and R b are independently selected from the group 

hydrogen, C r C 10 -alkyl, C 0 -C 10 -alkyl-C 6 -C I0 -aryl, C 6 -C 10 -aryl-C 0 -C 10 - 
alkyl, C o -C 10 -alkyl-heterocycle-C o -C 10 -alkyl, C,-C 6 -alkyl-NH-C,-C 6 - 
alkyl, C 0 -C 10 -alkyl-O-C 0 -C l0 -alkyl, C 0 -C I0 aIkyl-C(=O)-C 0 -C, 0 -alkyl, C 0 - 
C 10 -alkyl-NH-C(=O)-C 0 -C 10 -alkyl,C 0 -C I0 -alkyl-O-C(=O)-C 0 -C, 0 - alkyl, 
where any alkyl, aryl or heterocycle is optionally substituted with C,-C 10 - 
alkyl, C,-C, 0 -alkoxy, C 6 -C 10 -aryl, C 6 -C 10 -aryloxy, halo (F, CI, Br, I), 

hydroxy, carboxy, amino, nitro and S(O) 03 . 



15 



20 



25 



30 



25. The method of Claim 24 wherein the TBF m and TBF n from step (d) 
each independently bind to the target biological molecule with a K, of from about 3 
mM to about 100 u,M. 

26. The method of Claim 25 wherein the TBF m and TBF„ from step (d) 
each independently bind to the target biological molecule with a K, of from about 2 
mM to about 500 U.M. 

27. The method according to Claim 24, wherein the target biological 
molecule is a human or human pathogen protein. 



- 90 - 



BNSDCCID: <WO 99493 14A1 I > 



WO 99/49314 



PCT/US99/06734 



28. The method according to Claim 24, wherein the protein is an enzyme, a 
human hormone or a human receptor having nitrogen's in their naturally occuring 
isotopic abundance. 

5 29. The method according to Claim 24, wherein at least one of steps (b) and 

(d) is accomplished by ELISA assay. 



30. The method of Claim 21 wherein 
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15 



20 
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TBF m 


— LGt- 


XL 


LGj— 


TBF n 



is selected from the group; 



TBF m L ^n4^^^>! 



TBF n 



TBF„ 



TBF„ 



TBF„ 



TBF„ 







— / O— XL 


- A - 

J V L 
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N , 
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— 1 ^ 


, ^ R jN— 1 XL 


R 2 


Rt 


"I ,R 2 


Ri B 2 


-l ,R 2 ■ 

o 



TBF„ 



TBF„ 



TBF n 



TBF rt 



TBF^ 



XL 



TBF„ 



TB F_ 



TBF„ 



TBF_ 



R, f 

1 R 2 



TBF„ 



l}J O* 
Rt 



r 

R 2 



i 

R t R 2 
O 



TBF n 



TBF n 



^T|— ^Q-| XL | — 



TBF_ 



TBF n 



TBF„ 



t I 

R, R 2 



TBFn. 



i i 

R, R 



TBF„ 
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TBF- 



TBF,, 



R 1 H 2 



XL 



R 2 



TBF„ 



TBF„ 



and 



R 2 



TBF„ 



31. The method of Claim 16 wherein the candidate cross-linked target 
binding fragments are represented by the formulae: 



10 



TBF_ 



)_ LGi -[~xl]— 



LGhj 



TBF n 



and 



15 



20 



25 



30 



TBF m -part A 



TBF n -part C 



LGg— | XL [ LG 4 



TBF m -part B 



TBF n -part D 



where 

TBF m represents a first TBF selected from step (d); 
TBF (i represents a second TBF selected from step (d); 

TBF , n -P art A and B represent TBF m from step (d) where each fragment is 
bonded to a single atom in LG 3 ; 

TBF r part C and D represent TBF n from step (d) where each fragment is bonded 
to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
-(Co-Cj-alkyl-L'-L'-L^L'-Co-Cj-alkyl)-; 

LG, and LG 2 are linking groups independently selected from the group 
-C(R )=N-0-, -0-N=C(R>, -CTVNCR.)-, -N(R ^CH,-, -C(=0)-N(R>, -N(R>C(=0)-,. 
-N(R>C(=0)-0-, -0-C(=0)-N(R>, -N(R>C(=0)-N(R b )-, -N(R a )-C(=0)-N(R b )-, - 
SO r N(R> and -N(R )-SO r ; 

LG, and LG 4 are linking groups independently selected from the group 
>C=N-0-, -0-N=C<, -CH 2 -N< > >N-CH,-, -C(=0)-N<, >N-C(=0)-, >N-C(=O)-0-, -O- 
C(=Q)-N<, >N-C(=0)-N(R b )-, -N(R>C(=0)-N< f -SO r N< and >N-SO r , where < and 
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> represent two bonds linking TBF-part A, B, C, or D to the single N or C atom in LG 3 
or LG 4 ; 

R a and R b are independently selected from the group 
hydrogen, C r C 10 -alkyl, C 0 -C 10 -alkyl-C 6 -C l0 -aryl, C 6 -C I0 -aryI-C 0 -C I0 -alkyI, C 0 -C I0 -aIkyl- 
heterocycle-C 0 -C I0 -alkyl, C r C 6 -alkyl-NH-C,-C 6 -alky], C 0 -C I0 -alkyl-O-C 0 -C 10 -alkyl, C 0 - 
C I0 alky]-C(=O)-C o -C 10 -alkyl f C 0 -C l0 -alkyI-NH-C(=O)-C 0 -C I0 -alkyl, C 0 -C l0 -alkyl-O- 
C(=O)-C 0 -C t0 -alkyl, where any alkyl, aryl or heterocycle is optionally substituted 
with C r C I0 -alky], C r C l0 -alkoxy, C 6 -C 10 -aryI, C 6 -C l0 -aryloxy, halo (F, CI, Br, I), 
hydroxy, carboxy, amino, nitro and S(0) a3 ; 

TBF m , TBF n , TBF m -part A, TBF m -part B, TBF-part C and TBF-part D are 
each independently represented by formula I 



-A-(Cycle l)-B-(Cycle 2)-E (I) 

Where 

Cycle 1 and Cycle 2 are independently present or absent and are selected from 
a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 
6 or 7 atoms in the ring where the ring atoms are carbon or from 1-4 heteroatoms 
selected from; nitrogen, oxygen, and sulfur, and where any sulfur ring atom may 

optionally be oxidized and any carbon ring atom may form a double bond with O, NR n 
1 1 * 

and CR R , each ring nitrogen may be substituted with R n and any ring carbon may 
be substituted with R^; 

A and B are independently selected from 



■L 3 — L 2 


L 1 


■L 4 — L 3 - 


L L and 


-L 5 — L 4 


-L 3 L 2 — L 1 — 



where: 

L 1 is absent or may be selected from oxo (O), S(0) s> C(=0), C(=N-R n ), 

C^CRV), CO^R 1 '), C(R l ), C, het, N(R n ) or N; 
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L 2 is absent or may be selected from oxo (O), S(0) s , C(=0), C(=N-R n ), 

C(=CR 2 R 2 ), C(R 2 R 2 ), C(R 2 ), C, het, N(R n ) or N; 

L 3 is absent or may be selected from oxo (O), S(O) , C(=0), C(=N-R n ), 

s 

C(=CR 3 R 3 '), C(R 3 R 3 ') C(R 3 ), C, het, N(R n ) or N; 

L 4 is absent or may be selected from oxo (O), S(0) g , C(=0), C(=N-R n ), 

C(=CR 4 R 4 ), C(R 4 R 4 '), C(R 4 ), C, NR n or N; and 

L 5 is absent or may be selected from oxo (O), S(0) s> C(=0), C(=N-R n ), 

C(R 5 R 5 '), C(=CR 5 R 5 ), C(R 5 ), C, NR n or N; 

R 1 , R 1 , R 2 , R 2 , R 3 , R 3 , R 4 , R 4 , R 5 and R 5 each are independently 

selected from R a , R a ', R c and U-Q-V-W; where s is 0-2 

Optionally, each R J -R 5 or NR n together with any other R l -R 5 or NR n may 
form a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being 
a homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring 
containing 1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur 

atom may optionally be oxidized, each ring nitrogen optionally substituted with R n 

and each ring carbon optionally substituted with R^; 

Eis-L J -L 2 -L 3 -R a ; 

R is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
Cj-Cj jalkyl, halo(F, CI, Br, I)-Cj-Cj jalkoxy, hydroxy-Cj-Cj ^alkyl, cyano, 

isocyanate, carboxy-C ^-C ^ ^alkyl, amino, Cq-Cj jalkyl-amino-(Cj-Cgalkyl), C^- 

C alkyl-amino-di-(C . -C c alkyl), aminocarbonyl, C - -C . . alkylcarbonylamino, 

II I o 111 

carboxamido, carbamoyl, carbamoyloxy, formyl, formyloxy, azido, nitro, hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, C^-C^alkoxy, 

mercapto, sulfonamido, het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyl and indolyl; 
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R a is selected from the group of C 0 -Cj 0 alkyl-Q-C 0 -C 6 alkyl, Cg-C^alkenyl- 
Q-C 0 -C 6 alkyl, CQ-C^alkynyl-Q-C^Cgalkyl, C^Cj jCycloalkyl-Q-C^C^lkyl, C^ 
C 10 cycloalkenyl-Q-C 0 -C 6 alkyl, C^C^lkyl-^-C^aryl-Q-C^Cgalkyl, C 6 -C 1() aryl- 
Cj-^alkyl-Q-C^Cgalkyl, C 0 -C 6 aIkyl-het-Q-C 0 -C 6 aIkyI, C 0 -C 6 alkyl-Q-het-C 0 - 
C 6 alkyl, het-C 0 -C 6 alkyl-Q-C 0 -C 6 alkyl, C 0 -C 6 alkyl-Q-C 6 -C 12 aryl and Q-C^Cgalky, 

where any aryl or het is optionally substituted with 1-3 R d and any alkyl, alkenyl or 
alkynyl is optionally substituted with 1-3 R ; 



15 



a a 

R and R may join to form a 3-7 member homocyclic ring substituted with 1- 



3R a ; 



R c is selected from hydrogen and substituted or unsubstituted; amino, O-C^- 
C g alkyl, amino-CCj-Cgalkyl), amino-di-CCj-Cgalkyl), Cj-C^alkyl, C 2 -C 1() alkenyl, 
C 2 -C 1() a]kynyl, Cj-Cj jCycloalkyl, C 3 -C 1() cycloalkenyl, Cj-Cgalkyl-C^C^aryl, 
C 6 -C 1() aryl-C 1 -C 6 alkyl, Cj-Cgalkyl-het, het-Cj-Cgalkyl, Cg-C^aryl and het, where 

the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any 
20 aryl or het are 1-3 R d ; 

R d is selected from R h and R p ; 

R h is selected from the group OH, OCF^ OR c , SR m , halo(F, CI. Br, I), CN, 
isocyanate, N0 2> CF 3> C 0 -C 6 alkyl-NR n R n \ C 0 -C 6 alkyl-C(=O)-NR n R n ', C Q - 

25 

C 6 alkyl-C(=0)-R a , Cj-Cgalkyl, Cj-Cgalkoxy, C 2 -C g alkenyl, C 2 -C g alkynyl, C 3 - 

C^cycloalkyl, C^-C^cycloalkenyl, Cj-C^alkyl-phenyl, phenyl-C j-C^alkyl, C^- 

C^alkyloxycarbonyl, phenyl-CQ-C^alkyloxy, Cj-C^alkyl-het, het-Cj-C^alkyl, SO^- 

30 het, 0-C 6 -C 12 aryl, SO^C^C^aryl, SO^Cj-C^alkyl and het, where any alkyl, 

alkenyl or alkynyl may optionally be sustituted with 1-3 groups selected from OH, 
halo(F, CI, Br, I), nitro, amino and aminocarbonyl, where the substituents on any aryl 
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or het are 1-2 hydroxy, halo(F, CI, Br, I), CF r C^^alkyl, Cj-C^lkoxy, nitro and 
amino; 

R m is selected from hydrogen, S-Cj-Cgalkyl, C(=0)-Cj-C 6 alkyl, C(=0)- 
n n ' 

5 NR R .Cj-CgalkyI, halo(F, CI, Br, O-Cj-CgalkyI, benzyl and phenyl; 

R n is selected from the group R c , OH, OCF^ OR°, CN, isocyanate, NH- 
C(=0)-0-R c , NH-C(=0)-R c , NH-C(=0)-NHR c , NH-SCyR 5 , NH-S0 2 -NH-C(=0)- 
]0 R c , NH-C(=0)-NH-S0 2 -R s , C(=0)-0-R°, C(=0)-R c , C(=0)-NHR c , C(=0)-NH- 
C(=0)-0-R°, C(=0)-NH-C(=0)-R c , C(=0)-NH-SO_-R s , C(=0)-NH-S0 o -NHR c , 
S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2 , S0 2 -NH-C(=0)-0-R°, S0 2 -NH-C(=0)-0-R° and 

SO -NH-C(=0)-R C ; 

] 5 ^* 

R° is selected from hydrogen and substituted or unsubstituted C -C alkyl, C - 

16 0 

C 6 alkyl-C 6 -C 1() aryl, Cj-Cgalkylcarbonyl, C 2 -C 6 alkenyl, C^Cgalkynyl, C y 
C g cycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

20 

substituents on any aryl are 1-3 R* 5 ; 

R p is selected from the group; OH, halo(F, CI. Br, I), CN, isocyanate, OR°, 
SR m , SOR°, N0 2 , CF 3 , R c , NR n R n ', N(R n )-C(=0)-0-R°, N(R n )-C(=0)-R c , C Q - 

25 C 6 alkyl-S0 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', C(=0)-R c , 0-C(=0)-R c , C(=0)-0-R° 

n n 9 

and C(=0)-NR R , where the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a 

and the substituents on any aryl or het are 1-3 R d ; 

R s is a substituted or unsubstituted group selected from; C,-C 0 alkyl, C-- 
.10 1 8 2 

Cgalkenyl, C 2 -C g alkynyl, C 3 -C g cycloalkyl, C^cycloalkenyl, C^alkyl-phenyl, 

phenyl-C 0 -C 6 alkyl, C 0 -C 6 alkyl-het and het-C 0 -C 6 alkyl, where the substituits on any 
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15 



alky], alkenyl or alkynyi are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-membered ring containing from one to four 
heteroatoms selected from the group nitrogen, oxygen, and sulfur, the 5-membered 
5 ring having from 0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 
3 double bonds and where any carbon or sulfur atoms in the ring may optionally be 
oxidized, and where any nitrogen heteroatom may optionally be quatemized and where 
any ring may contain from 0-3 R d ; 
^ U is an optionally substituted bivalent radical selected from the group; C - 

C 6 alkyl, C 0 -C 6 alkyl-Q, C^alkenyl-Q, and 0,-C^alkynyI-Q, where the substituits 
on any alkyl, alkenyl or alkynyi are 1-3 R a ; 

. Q is absent or is selected from the group; -O-, -S(0) g -, -S0 2 -N(R n )-, -N(R n )-, 
-N(R n )-C(=0)-, -N(R n )-C(=0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C(=0)-0-, -het-,-C(=0)- 

N(R n )-, -PO(OR c )0- and -P(0)0-, where s is 0-2 and the heterocyclic ringis 
substituted with 0-3 R h ; 

V is absent or is an optionally substituted bivalent group selected from Cj- 
C 6 alkyl, C 3 -C 8 cycloalkyl, C^alkyl^g-C^aiyl. and Chalky-net, where the 
substituits on any alkyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n ', -NH-C(=0)- 

25 0-R°, -NH-C(=0)-NR n R n ', -NH-C(=0)-R c , -NH-SO.-R 8 , -NH-SCL-NR n R n -NH- 

£ 2 

S0 2 -NH-C(=0)-R c , -NH-C(=0)-NH-S0 2 -R s , -C(=0)-NH-C(=O)-0-R°, -C(=0)-NH- 
C(=0)-R c , -C(=0)-NH-C(=0)-NR n R n ', -C(=0)-NH-S0 9 -R s , -C(=0)-NH-SO - 
3() NR n R n ', -C(=S)-NR n R n ', -S0 2 -R s , -S0 2 -0-R°, -S0 2 -NR n R n ', -S0 2 -NH-C(=0)-0- 
R°, -S0 2 -NH-C(=0)-NR n R n ', -S0 2 -NH-C(=0)-R c . -0-C(=0)-NR n R n ', -0-C(=0)- 
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R c , -0-C(=0)-NH-C(=0)-R c , -0-C(=0)-NH-S0 2 -R s and -O-SCyR 5 ; 

Optionally, TBF m -part A together with TBF ni -part B and TBF n -part C together 
with TBF n -part D may independently form (Cycle 1) substituted with -B-(Cycle 2)-E.. 

5 

31 A compound made by the method of Claim 30. 



10 



15 



20 



25 



30 
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FIGURE 9 
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PHARMACOPHORE RECOMBINATION FOR THE 
IDENTIFICATION OF SMALL MOLECULE DRUG LEAD 

COMPOUNDS 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention is directed to novel methods for identifying small molecule 
drug lead compounds. 

5 This invention was made with Government support under Contract No. R01 

GM50353 awarded by the National Institutes of Health. The Government has certain 
rights to this invention. 

2. Description of the Related Art 

In response to the ever increasing demand for novel compounds useful in the 

10 effective treatment of various maladies, the medical research community has developed a 
number of different strategies for discovering and optimizing new therapeutic drugs. 
For the most part, these strategies are dependent upon molecular techniques that allow 
the identification of tightly binding ligands for a given biological target molecule. Once 
identified, these ligands may then carry out their therapeutic functions by activating, 

15 inhibiting or otherwise altering the activity of the molecular target to which they bind. 

In one such strategy, new therapeutic drugs are identified by screening 
combinatorial libraries of synthetic small molecule compounds, determining which 
compound(s) have the highest probability of providing an effective therapeutic and then 
optimizing the therapeutic properties of the identified small molecule compound(s) by 

20 synthesizing structurally related analogs and analyzing them for binding to the target 
molecule (Gallop et al., J. Med. Chem. 37:1233-1251 (1994), Gordon et al., J. Med. 
Chem. 37:1385-1401 (1994), Czarnik and Ellman, Acc. Chem. Res. 29:112-170 (1996), 
Thompson and Ellman, Chem. Rev. 96:555-600 (1996) and Balkenhohl et al., Angew. 
Chem. Int. Ed. 35:2288-2337 (1996)). However, this process is not only time 

25 consuming and costly, it often does not provide for the successful identification of a 
small molecule compound having sufficient therapeutic potency for the desired 
application. For example, while the preparation and evaluation of combinatorial 
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libraries of small molecules has proven somewhat useful for new drug discovery, the 
identification of small molecules for difficult molecular targets (e.g., such as those 
useful for blocking or otherwise taking part in protein-protein interactions) has not been 
particularly effective (Brown, Molecular Diversity 2:217-222 (1996)). 

One issue that limits the success of combinatorial library approaches is that it is 
possible to synthesize only a very small fraction of the possible number of small 
molecules. For example, greater than 10 60 different small molecules having valid 
chemical structures and molecular weights under 600 daltons can be envisioned. 
However, even the most ambitious of small molecule combinatorial library efforts have 
been able to generate libraries of only tens to hundreds of millions of different 
compounds for testing. Therefore, combinatorial technology allows one to test only a 
very small subset of the possible small molecules, thereby resulting in a high probability 
that the most potent small molecule compounds will be missed. Thus, suitable small 
molecule compounds having the required availability, activity or chemical and/or 
structural properties often cannot be found. Moreover, even when such small molecule 
compounds are available, optimization of those compounds to identify an effective 
therapeutic often requires the synthesis of an extremely large number of structural 
analogs and/or prior knowledge of the structure of the molecular target for that 
compound. Furthermore, screening large combinatorial libraries of potential binding 
compounds to identify a lead compound for optimization can be difficult and time- 
consuming because each and every member of the library must be tested. It is evident, 
therefore, that novel methods for rapidly and efficiently identifying new small molecule 
drug leads are needed. 

Living organisms evolve through a process that includes both (1) genetic 
recombination, where sexual reproduction acts to mix and recombine the attributes of the 
parent organisms to provide progeny having attributes of both parents, and (2) natural 
selection, where only those progeny that are sufficiently "fit" are capable of passing 
their attributes on to the next generation. Approaches that closely model the process by 
which organism evolve have previously been reported for identifying small molecules 
that bind to receptors and enzymes (Weber et al., Angew. Chem. Int. Ed. Engl. 34:2280- 
2282 (1995) and Singh et al., J. Am. Chem Soc. 118:1669-1676 (1996)). These 
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approaches are based upon the mathematical method termed "genetic algorithms" 
(Holland, Sci. Am. 66-72 (1992)). Using genetic algorithms, a population of different 
compounds is screened to identify the compounds that bind to the receptor or enzyme 
(i.e., the "fittest" compounds). A population of progeny compounds is then prepared by 
5 recombining the building blocks that were used to prepare the "fittest" compounds. A 
screen is then performed to identify the compounds that bind to the target with the 
highest affinity, which are made up of the optimal building block combinations. 

However, because the building blocks employed in the genetic algorithm 
approach are not preselected, one of two techniques are used to identify tight binding 

10 ligands: (1) extremely large populations of compounds must be screened and 
recombined, or (2) multiple rounds of screening and recombination are performed on 
relatively small populations where additional building blocks are gradually introduced 
through a process that is analogous to genetic mutation. In this second approach, many 
rounds of selection, recombination and building block introduction are required to 

15 identify the optimal building block combinations in analogy to the many rounds of 
selection, reproduction and mutation that are required in the evolution of living 
organisms. Thus, the use of genetic algorithms is currently limited because of the large 
amount of time required for compound preparation and screening, wherein the goal of 
new drug discovery is to identify a potent compound as quickly as possible. 

20 Another recently reported approach for identifying high affinity ligands for 

molecular targets of interest is by determining structure-activity relationships from 
nuclear magnetic resonance analysis, i.e., "SAR by NMR" (Shuker et al., Science 
274:1531-1534 (1996) and U.S. Patent No. 5,698,401 by Fesik et al.). In this 
approach, the physical structure of a target protein is determined by NMR and then 

25 small molecule building blocks are identified that bind to the protein at nearby points on 
the protein surface. Adjacently binding small molecules are then coupled together with a 
linker in order to obtain compounds that bind to the target protein with higher affinity 
than the unlinked compounds alone. Thus, by having available the NMR structure of the 
target protein, the lengths of linkers for coupling two adjacently binding small molecules 

30 can be determined and small molecule ligands can be rationally designed. This approach 
has been useful for identifying compounds that bind to FK506 binding protein with a 
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= 20 nM (Shuker et aL, supra) and to stromelysin with a K d = 15 nM (Hajduk et al., 7. 
Am. Chem. Soc. 119:5818-5827 (1997) and Hajduk et al., 7. Am. Chem. Soc. 119:5828- 
5832 (1997)). 

However, while the SAR by NMR method is powerful, it also has serious 
5 limitations. For example, the approach requires huge amounts of target protein (>200 
mg) and this protein typically must be 15 N-labeIed so that it is useful for NMR studies. 
Moreover, the SAR by NMR approach usually requires that the target protein be soluble 
to >0.3 mM and have a molecular weight less than about 25-30 kDa. Additionally, the 
structure of the target protein is first resolved by NMR, a process which often can 

10 require a 6 to 12 month time commitment. 

From the above, it is evident that there is a need for novel techniques useful for 
rapidly and efficiently identifying small molecule drug lead compounds that are capable 
of binding with high affinity to a molecular target of interest. We herein describe for the 
first time a method which is based upon pharmacophore recombination, wherein a 

15 population of small molecule pharmacophores are "pre-selected" for the ability to bind 
to a molecular target and wherein the small molecule pharmacophores that bind with the 
highest affinity are then chemically linked in various combinations to provide a library 
of potential high affinity binding ligands. The library of potential binding ligands is then 
screened using a simple functional assay for the presence of one or more compounds that 

20 bind to the target molecule with very high affinity. 

SUMMARY OF THE INVENTION 

Applicants herein describe a molecular approach for rapidly and efficiently 
identifying small molecule ligands that are capable of binding to a target biological 
molecule with high affinity, wherein ligand compounds identified by the method are 

25 useful as new small molecule drug lead compounds. The herein described methods 
allow a library of only the most favorable compounds to be assayed for binding to a 
target biological molecule without the need for screening all possible small molecule 
compounds and combinations thereof for binding to the target as is required in standard 
combinatorial library approaches. More specifically, a library of candidate target 

30 binding fragments is- assembled and subjected to a first screen or "pre-screened" to 
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identify a subset of that library that bind to a target biological molecule with or below a 
certain dissociation constant. Those candidate target binding fragments identified during 
this "pre-screening" step as being capable of binding to the target biological molecule 
are then coupled or cross-linked in a variety of combinations using one or more linker 
5 elements to provide a library of potential high affinity binding ligands or candidate 
cross-linked target binding fragments, whose building blocks represent the small 
candidate target binding fragments having the highest affinity for the target biological 
molecule as identified in the "pre-screening" step. The library of potential ligands or 
candidate cross-linked target binding fragments for binding to the target biological 

10 molecule is then screened a second time to identify those members that exhibit the lowest 
dissociation constant for binding to the target biological molecule. Because the library 
of candidate target binding fragment building blocks is initially "pre-screened" to select 
for a much smaller set of the most favorable building blocks, the most productive 
building block and cross-linker combinations can be identified without the laborious task 

15 of screening all possible combinations of all building blocks coupled together by a set of 
linkers. The process of identifying high affinity drug lead compounds is therefore, 
greatly expedited. 

With regard to the above, one embodiment of the present invention is directed to 
a method for identifying drug lead compounds that bind to a biological target molecule 
20 of interest, wherein the method comprises the steps of: 

(a) Assembling a library of candidate target binding fragments (CTBF) 
capable of being chemically cross-linked by a cross-linker element to provide candidate 
cross-linked target binding fragments for binding to the target biological molecule; 

(b) screening the library of candidate target binding fragments to identify at 
25 least first and second candidate target binding fragments which bind to the target 

biological molecule; 

(c) chemically cross-linking the at least first and second candidate target 
binding fragments or structurally related analogs thereof with a cross-linker element to 
provide a library of candidate cross-linked target binding fragments for binding to the 

30 target biological molecule; and 
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(d) screening the library obtained in (c) to identify a drug lead compound that 
binds to the target biological molecule. 

In various preferred embodiments, the library of candidate target binding 
fragments may comprise compounds of less than 500 daltons, may comprise simple 
5 aldehydes, amines, amides, carbamates, ureidos, sulfonamides, alcohols, carboxylic 
acids, thiols, aryl halides, alkenes, alkynes, ketones, ethers and/or oximes and/or may 
bind to the target biological molecule with a K d of 10 mM or lower. In a particularly 
preferred embodiment, the library or candidate target binding fragments may comprise 
oxime compounds, wherein the structurally related aldehyde analogs of those oxime 

10 compounds are capable of being chemically cross-linked via an 0,CT-diamino-alkanedioI 
cross-linker. Target biological molecules that find use in the described methods include, 
for example, proteins, nucleic acids and saccharides, preferably proteins. Preferred 
TBM's include human or human pathogen proteins, especially enzymes, human 
hormones, human receptors and fragments thereof. These TBM's may all contain atoms 

15 of naturally occuring isotopic abundance. 

In other preferred embodiments, the library of candidate cross-linked target 
binding fragments comprises candidate cross-linked target binding fragments of less than 
about 1000 daltons, that may be homo- or heterodimeric having a for the TBM of 
from about 500 \xM to about 500 nM or lower. 

20 Another embodiment of the present invention is directed to a method for 

inhibiting the interaction between first and second biological molecules, wherein the 
method comprises the step of contacting a system comprising both the first and second 
biological molecules with a binding inhibitory amount of a candidate cross-linked target 
binding fragment identified by the above described method, wherein the candidate cross- 

25 linked target binding fragment binds to one of the first or second biological molecules 
and inhibits their ability to interact. 

A further embodiment of the present invention is directed to a drug lead 
compound made by the the method described herein, where the compound is represented 
by the formula: 



30 



TBF„ 



■LGt— XL LGt— 



TBF n 



and 
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TBF m -part A TBF n -part C 



LGt - XL — LG 



TBF m -part B 



TBF n -part D 



where 

TBF m represents a first TBF selected from step (d); 
TBF n represents a second TBF selected from step (d); 
5 TBF m -part A and B represent TBF m from step (d) where each fragment is bonded 

to a single atom in LG 3 ; 

TBF n -part C and D represent TBF n from step (d) where each fragment is bonded 
to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
10 -(Co-C.-alkyl-L^L^L'-L'-L'-Co-C.-alkyl)-; 

LGj and LG 2 are linking groups independently selected from the group - 
C(R a ) = N-0-, -0-N = C(R a )-, -CH 2 -N(RJ-, -N(R a )-CH 2 -, -C^O)-^)-, -N(R a )- 
C< = 0)-, -N(R a )-C( = 0)-0-, -0-C( = 0)-N(R a K -N(R 3 )-C( = 0)-N(R b )-, -N(R a )-C(=0)- 
N(R b )-, -S0 2 -N(R a )- and -N(R a )-SO r ; 
15 LG 3 and LG 4 are linking groups independently selected from the group >C = N- 

O-, -0-N = C<, -CH : -N<, >N-CH r , -C( = 0)-N<, >N-C( = OK >N-C( = 0)-0-, - 
0-C( = 0)-N<, >N-C( = 0)-N(R b )-, -N(RJ-C( = 0)-N<, -SO r N< and >N-S0 2 -, 
where < and > represent two bonds linking CTBF-part A, B, C, or D to the single N 
or C atom in LG 3 or LG 4 ; 
20 Ra and R b are independently selected from the group 

hydrogen, C r C J0 -alkyl, C 0 -C 10 -alkyl-C 6 -C I0 -aryl, C 6 -C ]0 -aryl-C 0 -C I0 -alkyl, C 0 -C 10 -alkyI- 
heterocycIe-C 0 -C 10 -alkyl, C r C 6 -alkyl-NH-C r Q-alkyl, C 0 -C, 0 -alkyl-O-C 0 -C 10 -alkyl, C 0 - 
C 10 alkyl-C( = O)-C 0 -C ]0 -alkyl, C 0 -C 10 -alkyl-NH-C( = O)-C 0 -C 10 -alkyl, C 0 -C 10 -alkyl-O- 
C( = O)-C 0 -C, 0 -alkyl, where any alkyl, aryl or heterocycle is optionally substituted with 
25 C r C 10 -alkyl, C r C 10 -alkoxy, C 6 -C 1(r aryl, C 6 -C 10 -aryloxy, halo (F, CI, Br, I), hydroxy, 
carboxy, amino, nitro and S(OV 3 ; 
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TBF m , TBF n , TBF nr part A, TBF m -part B, TBF n -pan C and TBF n -part D are each 
independently represented by formula I 

-A-(Cycle l)-B-(Cycle 2)-E (I) 

Where 

5 Cycle 1 and Cycle 2 are independently present or absent and are selected from a 

mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 6 
or 7 atoms in the ring where the ring atoms are carbon or from 1-4 heteroatoms selected 
from; nitrogen, oxygen, and sulfur, and where any sulfur ring atom may optionally be 

oxidized and any carbon ring atom may form a double bond with O, NR n and CR 1 !* 1 ', 

10 each ring nitrogen may be substituted with R n and any ring carbon may be substituted 

with R d ; 



A and B are independently selected from 



15 




where: 

L 1 is absent or may be selected from oxo (O), S(O) , C( = 0), C( = N-R n ), 

C( = CR 1 R 1 '), C(R ] R r ), CCR 1 ), C, het, N(R n ) or N; 
2 

L is absent or may be selected from oxo (O), S(0) s> C(=0), C( = N-R n ), 

20 C( = CR 2 R 2 '), C(R 2 R 2 '), C(R 2 ), C, het, N(R n ) or N; 

3 

L is absent or may be selected from oxo (O), S(0) s , C( = 0), C( = N-R n ), 

C( = CR 3 R 3 '), C(R 3 R 3 ') C(R 3 ), C, het, N(R n ) or N; 
4 

L is absent or may be selected from oxo (O), S(0) s , C( = 0), C( = N-R n ), 
C(=CR 4 R 4 '), C(R 4 R 4 '), C(R 4 ), C, NR n or N; and 
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L 5 is absent or may be selected from oxo (O), S(0)^ C( = 0), C( = N-R n ), 

C(R 5 R 5 '), C( = CR 5 R 5 ), C(R 5 ), C, NR n or N; 

rU 1 , R 2 , R 2 , R 3 , R 3 \ R 4 , R 4 \ R 5 and R 5 ' each are independently selected 

from R a , R a \ R c and U-Q-V-W; where s is 0-2 

5 Optionally, each R*-R~* or NR n together with any other R*-R^ or NR n may form 

a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being a 
homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring containing 
1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur atom may 

optionally be oxidized, each ring nitrogen optionally substituted with R n and each ring 

10 carbon optionally substituted with R^; 

E is -L ] -L 2 -L 3 -R a ; 

R a is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
Cj-Cj jalkyl, halo(F, CI, Br, IK^-Cj ^lkoxy, hydroxy-C ] -C 1 jalkyl, cyano, 

isocyanate, carboxy-C^-Cj jalkyl, amino, CQ-C^alkyl-amino-CCj-Cgalkyl), C^- 

15 Cj 1 alkyl-amino-di-(C 1 -C g alkyl), aminocarbonyl, C^-C^ jalkylcarbonylamino, 

carboxamido, carbamoyl, carbamoyloxy, formyl, formyloxy, azido, nitro, hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, C^C^alkoxy, 

mercapto, sulfonamido, het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyl and indolyl; 

20 R a is selected from the group of CQ-C^alkyl-Q-CQ-C^lkyl, C 0 -C 10 alkenyl-Q- 

C Q -C 6 alkyl, C Q -C 10 alkynyl-Q-C 0 -C 6 alkyl, Cg-Cj jCycloalkyl-Q-CQ-Cgalkyl, Cy 

C 1() cycloalkenyl-Q-C 0 -C 6 alkyl, Cj-Cgalkyl-Cg-C^aryl-Q-CQ-C^lkyl, C 6 -C 1() aryl- 

Cj-Cgalkyl-Q-CQ-Cgalkyl, C 0 -C 6 aIkyl-het-Q-C 0 -C 6 alkyl, C 0 -C 6 alkyl-Q-het-C 0 - 

C 6 alkyl, het-C 0 -C 6 alkyl-Q-C 0 -C 6 alkyl, C 0 -C 6 alkyl-Q-C 6 -C 12 aryl and Q-Cj-Cgalky, 
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where any aryl or het is optionally substituted with 1-3 R d and any alkyh alkenyl or 

alkynyl is optionally substituted with 1-3 R a ; 
a a' 

R andR may join to form a 3-7 member homocyclic ring substituted with 1-3 

R a ; 

5 R c is selected from hydrogen and substituted or unsubstiruted; amino, O-Cj- 

C g alkyl, amino-tCj-Cgalkyl), amino-dHCj -Chalky 1), Cj-C^alkyl, C 2 -C 1() alkenyl, 
C 2 -C 1 0 alkynyl, CyC l jCycloalkyl, C 3 -C 1() cycloalkenyI, Cj-C^lkyl-C^C^aryl, C^- 
C 10 aryl " C r C 6 alky1 ' Cj-Cgalkyl-het, het-C^C^lkyl, C 6 -C 12 aryl and het, where the 

substituits on any alky!, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or 
10 het are 1-3 R d ; 

R d is selected from R h and R p ; 

R h is selected from the group OH, OCF 3 , OR c , SR m , halo(F, CI. Br, I), CN, 

isocyanate, N0 2 , CF 3 , C 0 -C 6 alkyl-NR n R n ', C 0 -C 6 alkyl-C(=O)-NR n R n ', C 0 -C 6 alkyl- 

C( = 0)-R a , Cj-Cgalkyl, Cj-Cgalkoxy, C 2 -C g alkenyl, C 2 -C g alkynyl, C 3 -C 6 cycloalkyl, 

15 c 3 - c 6 c y cl oalkenyl, -Chalky I-phenyl, phenyl-Cj-C^lkyl, C y - 

C 6 alkyloxycarbonylamino, Cj-Cgalkyloxycarbonyl- C 0 -C 6 alkyl, phenyI-C Q - 

C 6 alkyloxy, C^Cgalkyl-het, het-Cj-C^lkyl, SCyhet, 0-C 6 -C 12 aryl, S0 2 -C 6 - 

C 12 aryl, SO^Cj-Cgalkyl and het, where any alkyl, alkenyl or alkynyl may optionally 

be sustituted with 1-3 groups selected from OH, halo(F, CI, Br, I), nitro, amino and 
20 aminocarbonyl, where the substituents on any aryl or het are 1-2 hydroxy, halo(F, CI, 
Br, I), CF 3 , Cj-C^alkyl, Cj-C^alkoxy, nitro and amino; 

R m is selected from hydrogen, S-Cj -Chalky 1, C^CO-Cj -Chalky I, C(=0)- 
n n* 

NR R , Cj-Cgalkyl, halo(F, CI, Br, D-Cj-C^lkyl, benzyl and phenyl; 
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R n is selected from the group R c , OH, OCF 3> OR°, CN, isocyanate, NH- 

C( = 0)-0-R c , NH-C( = 0)-R c , NH-C( = 0)-NHR c , NH-S0 2 -R s , NH-S0 2 -NH-C(=0)- 

R c , NH-C(=0)-NH-S0 2 -R s , C( = 0)-0-R°, C( = 0)-R c , C( = 0)-NHR c , C( = 0)-NH- 

C( = 0)-0-R°, C( = 0)-NH-C( = 0)-R C , C( = 0)-NH-S0 2 -R S , C( = 0)-NH-S0 2 -NHR c , 

5 S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2> S0 2 -NH-C( =0)-0-R°, S0 2 -NH-C( = 0)-0-R° and 

S0 2 -NH-C( = 0)-R c ; 

R° is selected from hydrogen and substituted or unsubstituted Cj-C^alkyl, Cq- 
C^alkyl-C^-CjQaryl, C } -Chalky lcarbonyl, C^-C^alkenyl, C 2 -Cgalkynyl, C^- 

CgCycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

10 substituents on any aryl are 1 -3 RP; 

R p is selected from the group; OH, COOH, COH, NH 2 , C Q -C 6 alkyl, halo(F, 

CI. Br, I), CN, isocyanate, OR°, SR m , SOR°, N0 2> CF 3> R c , NR n R n ', N(R n )-C( = 0)- 

0-R°, N(R n )-C(=0)-R c , S0 2 -R s , C 0 -C 6 aJkyl-S0 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', 

C( = 0)-R c . 0-C( = 0)-R c , C( = 0)-0-R° and C( = 0)-NR n R n ', where the substituits on 
15 any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or het are 1-3 
R d ; 

R s is a substituted or unsubstituted group selected from; Cj-Cgalkyl, C 2 ~ 
C g alkenyl, C 2 -C g alkynyl, C 3 -C g cycIoalkyl, C 3 -C 6 cycloalkenyl, C Q -C 6 alkyl- C 6 - 
C 1() aryl, C 6 -C J0 aryl-C 0 -C 6 alkyl, C Q -C 6 alkyl-het and het-C Q -C 6 alkyl, where the 

20 substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or 
het are 1-3 R d ; 
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het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-rnembered ring containing from one to four heteroatoms 
selected from the group nitrogen, oxygen, and sulfur, the 5-membered ring having from 
0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 3 double bonds and 
5 where any carbon or sulfur atoms in the ring may optionally be oxidized, and where any 
nitrogen heteroatom may optionally be quaternized and where any ring may contain from 

0-3 R d ; 

U is an optionally substituted bivalent radical selected from the group; Cj- 
C 6 alkyl, C 0 -C 6 alkyl-Q, C 2 -C 6 alkenyl-Q, and C^-C^alkynyl-Q, where the substituits on 

10 any alkyl, alkenyl or alkynyl are 1-3 R a ; 

Q is absent or is selected from the group; -O-, -S(0) s -, -S0 2 -N(R n )-, -N(R n )-, - 

N(R n )-C( = 0)-, -N(R n )-C( = 0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C( = 0)-0-, -het-,-C(=0)- 

N(R n )-, -PO(OR c )0- and -P(0)0-, where s is 0-2 and the heterocyclic ring is 
substituted with 0-3 R n ; 

15 v is absent or is an optionally substituted bivalent group selected from Cj- 

C 6 alkyl, C 3 -C g cycloalkyl, C 0 -C 6 alkyl-C 6 -C ](;) aryl, and C Q -C 6 aIky-het, where the 

substituits on any alkyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n ', -NH-C( = 0)- 
O-R 0 , -NH-C(=0)-NR n R n ' ) -NH-C( = 0)-R c , -NH-S0 2 -R s , -NH-S0 2 -NR n R n ', -NH- 

20 S0 2 -NH-C(=0)-R c , -NH-C(=0)-NH-S0 2 -R s , -C(=0)-NH-C(=0)-0-R°, -C(=0)- 

NH-C(=0)-R c , -C( = 0)-NH-C( = 0)-NR n R n ', -C( = 0)-NH-S0 2 -R s , -C(=0)-NH-SC> 2 - 

NR n R n , -C( = S)-NR n R n ', -S0 2 -R s , -SCyO-R 0 , -S0 2 -NR n R n , -S0 2 -NH-C(=0)-0- 

R°, -S0 2 -NH-C( = 0)-NR n R n ', -S0 2 -NH-C(=0)-R c , -0-C( = 0)-NR n R n \ -0-C( = 0)- 

R c , -0-C(=0)-NH-C(=0)-R c , -0-C( = 0)-NH-S0 2 -R s and -0-S0 2 -R s ; 
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Optionally, TBF m -pan A together with TBF m -part B and TBF n -part C together 
with TBF n -part D may independently form Cycle 1 substituted with -B-(Cycle 2)-E. 

A drug lead precursor or intermediate of this invention is represented by C 0 -C 2 - 

alkyl-L'-L^L^L'-L^Co-Q-alkyl where L 1 through L 5 are defined above. Additional 
5 embodiments of the present invention will become evident to the ordinarily skilled 
artisan upon a review of the present specification. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a synthetic reaction wherein an aldehyde is reacted with O- 
methyl hydroxy lamine to produce an O-methyl oxime compound. 
10 Figure 2 shows a variety of organic aldehyde pharmacophore molecules which 

after conversion to )-methyl oximes were identified as being capable of inhibiting the 
interaction between CD4 and gp!20. 

Figure 3 shows chemistry useful for chemically coupling two aldehydes via an 
O,0'-diamino-alkanediol linker to produce both heterodimeric and homodimeric oxime 
15 compounds. 

Figure 4 shows a variety of aldehyde pharmacophores found to be highly 
efficient in dimeric form for inhibiting the interaction between CD4 and gpl20. 

Figure 5 shows an O-methyl oxime compound found to be particularly effective 
in dimeric form for inhibiting the interaction between CD4 and gpl20. 
20 Figure 6 shows a specific ligand having particularly high activity for inhibiting 

the interaction between CD4 and gp!20. 

Figure 7 shows a synthetic reaction wherein an aldehyde is reacted with a 
dimethylamine in the presence of support-bound triacetoxyborohydride to give rise to an 
N,N-dimethy lamine organic compound. 
25 Figure 8 shows a chemical synthesis sequence resulting in the production of a 

ligand of the present invention. 

Figure 9 shows a flow diagram for the fragment assembly sequence resulting in 
the production of a drug lead compound of the present invention. 
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DETAILE D DESCRIPTION OF THE PREFERRED EMBODIMENT 
I. General Description 

The present invention provides a rapid and efficient method for identifying small 
molecule candidate target binding fragments or ligands that are capable of binding with 
5 high affinity to target biological molecules (TBM) of interest. The compounds 
identified by the subject method find use, for example, as drug lead compounds for the 
development of novel therapeutic drugs. The subject method involves the assembly of a 
library of small organic candidate target binding fragments which are capable of being 
chemically cross-linked via a linker element. The library of organic candidate target 
10 binding fragments may be "pre-screened" in order to identify members of the library 
that are capable of binding to a target biological molecule. At least a portion of the 
small organic candidate target binding fragments identified during the "pre-screen" as 
being capable of binding to the target are then chemically cross-linked to one another in 
various combinations to provide a library of potential or candidate cross-linked target 
15 binding fragments for binding to the target biological molecule. The library of candidate 
cross-linked target binding fragments having potential to be high affinity antagonists (or 
agonists) are then screened to identify drug lead compounds that bind to the target 
biological molecule with high affinity. The step of "pre-screening" the library of small 
organic candidate target binding fragments to identify those that are capable of binding 
20 to the target allows one to limit the library of potential binding ligands to only those that 
are comprised of the most favorable organic candidate target binding fragment building 
blocks, thereby decreasing the required complexity of the library of potential ligands 
while increasing the probability of identifying a molecule exhibiting high binding affinity 
for the biological target molecule. 
25 ° ne embodiment of the present invention is directed to a method for identifying a 

drug lead compound that binds to a target biological molecule of interest. The subject 
method involves assembling a library of organic candidate target binding fragment(s) 
that are capable of being chemically cross-linked via a chemically compatible cross- 
linker to provide candidate cross-linked target binding fragment(s) for binding to the 
30 target biological molecule. In this regard, the phrase "assembling a library of organic 
candidate target binding fragments" is to be construed broadly and is intended to 
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encompass all means by which one may obtain a library comprising two or more 
organic compounds which include, for example, obtaining such compounds from a 
commercial or non-commercial source, synthesizing such compounds using standard 
chemical synthesis technology or combinatorial synthesis technology (see Gallop et al. 
5 (1994), supra, Gordon et al. (1994), supra, Czarnik and Ellman (1996), supra, 
Thompson and Ellman (1996), supra and Balkenhohl et al. (1996) supra) and obtaining 
such compounds as degradation products from larger precursor compounds, e.g. known 
therapeutic drugs, large chemical molecules, and the like. 

The candidate target binding fragments (CTBF) and are, for the most part, small 

10 water soluble organic molecules that have one or more chemically reactive 
functionalities, referred to as linkable (or linkage) functional groups (LFG) (or sites that 
may be readily converted to a chemically reactive functionality using standard 
technology (BLFG)) that provide a site for coupling to another compound or candidate 
target binding fragment via a chemically compatible cross-linker. Thus, the candidate 

15 target binding fragments of the present invention are capable of being chemically 
coupled to one another via a cross-linker to provide candidate cross-linked target binding 
fragments for binding to the target biological molecule, meaning that the candidate target 
binding fragment compounds have a reactive functionality, or a site that can be readily 
chemically converted to a reactive functionality, where a chemically compatible cross- 

20 linker may covalently attach thereto, thereby allowing multimerization of the candidate 
target binding fragments through the cross-linker. "Ligands", "candidate ligands' or 
"candidate cross-linked target binding fragments " for binding to a "target biological 
molecule" for purposes herein are compounds that are obtained from reacting two or 
more organic compounds, which may be the same or different, preferably different, with 

25 one or more cross-linker so as to produce a molecule comprising two or more target 
binding fragments and one or more cross-linker. Such ligands are referred to herein as 
candidate cross-linked target binding firagments(CXL-TBF). 

Candidate target binding fragments having the linkage functional group modified 
or blocked so that it contains substantially the same linking group as is found in the 

30 candidate cross-linked binding fragments are sometimes referred to as monomers. 
Monomers or monomeric compounds that find use in the present invention include, for 
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example, aldehydes, ketones, oximes, such as O-alkyl oximes, preferably O-methyl 
oximes, hydrazones, semicarbazones, carbazides, primary amines, secondary amines, 
such as N-methylamines, tertiary amines, such as N,N-dimethylamines, N-substituted 
hydrazines, hydrazides, alcohols, ethers, thiols, thioethers, thioesters, disulfides, 
5 carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, thioketals, 
acetals, thioacetals, aryl halides, aryl sulfonates, alkyl halides, alkyl sulfonates, aromatic 
compounds, heterocyclic compounds, anilines, alkenes, alkynes, diols, amino alcohols, 
oxazolidines, oxazolines, thiazolidines, thiazolines, enamines, sulfonamides, epoxides, 
and aziridines, and the like, all of which have chemically reactive functionalities (or are 
10 directly prepared from precursor compounds that have chemically reactive 
functionalities) capable of linking, either directly or indirectly, to a cross-linker . In 
fact, virtually any small organic molecule that is capable of being chemically coupled to 
another small organic molecule may find use in the present invention with the proviso 
that it is sufficiently soluble in aqueous solutions to be tested for its ability to bind to a 

15 target biological molecule. 

The above described monomers or candidate target binding fragments will serve 
as the individual building blocks for candidate cross-linked binding fragments prepared 
therefrom. Candidate target binding fragments that find use herein will generally be less 
than about 2000 daltons in size, usually less than about 1500 daltons in size, more 

20 usually less than about 750 daltons in size, preferably less than about 500 daltons in size, 
often less than about 250 daltons in size and more often less than about 200 daltons in 
size, although organic molecules larger than 2000 daltons in size will also find use 
herein. Candidate target binding fragments that find use may be employed in the herein 
described method as originally obtained from a commercial or non-commercial source 

25 (for example, a large number of small organic chemical compounds that serve as 
candidate target binding fragments are readily obtainable from commercial suppliers 
such as Aldrich Chemical Co., Milwaukee, WI and Sigma Chemical Co., St. Louis, 
MO) or may be obtained by chemical synthesis. Examples of the latter include the 
preparation of a library of organic oxime compounds from a single step condensation of 

30 commercially available aldehydes with O-alkyl hydroxylamine as described in Example I 
below and the preparation of a library of N,N-dimethylamine candidate cross-linked 
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target binding fragments from the reductive amination of commercially available 
aldehydes and dimethylamine using support-bound triacetoxyborohydride as described in 
Example II below and by Kaldor et al., Tetrahedron Leu. 37:7193-7196 (1996). 

Libraries of candidate target binding fragments or candidate cross-linked target 
5 binding fragments which find use herein will generally comprise at least 2 organic 
compounds, often at least about 25 different organic compounds, more often at least 
about 100 different organic compounds, usually at least about 500 different organic 
compounds, more usually at least about 1000 different organic compounds, preferably at 
least about 2500 different organic compounds, more preferably at least about 5000 
10 different organic compounds and most preferably at least about 10,000 or more different 
organic compounds. Libraries may be selected or constructed such that each individual 
molecule of the library may be spatially separated from the other molecules of the 
library (e.g., each member of the library is present in a separate microtiter well) or two 
or more members of the library may be combined if methods for deconvolution are 
15 readily available. The methods by which the library of organic compounds are prepared 
will not be critical to the invention. 

Once assembled, a library of organic candidate target binding fragments will be 
screened using one of any number of different known assays for the purpose of 
identifying candidate cross-linked target binding fragments that are capable of binding to 
20 a target biolocical molecule of interest. "Biological target molecules", "target biological 
molecules", "target biomolecules", "molecular targets", "biological targets", and other 
grammatical equivalents refer to target biological molecules (TBM) that are available 
(either commercially, recombinantly, synthetically or otherwise) in sufficient quantities 
for use in in vitro binding assays and for which there is some interest for identifying a 
25 high affinity binding partner. For the most part, target biological molecules are 
proteins, including human proteins or human pathogen proteins that may be associated 
with a human disease condition, such as cell surface and soluble receptor proteins, such 
as lymphocyte cell surface receptors, enzymes, such as proteases, clotting factors, 
serine/threonine kinases and dephosphorylases, tyrosine kinases and dephosphorylases, 
30 bacterial enzymes, fungal enzymes and viral enzymes, signal transduction molecules, 
transcription factors, proteins associated with DNA and/or RNA synthesis or 
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degradation, immunoglobulins, hormones, receptors for various cytokines including, for 
example, erythropoietin/EPO, granulocyte colony stimulating receptor, granulocyte 
macrophage colony stimulating receptor, thrombopoietin (TPO), 1L-1, IL-2, IL-3, IL-4, 
IL-5, IL-6, IL-10, IL-11, IL-12, growth hormone, prolactin, human placental lactogen 
5 (LPL), CNTF, octostatin, various chemokines and their receptors such as RANTES, 
MlPl-ct, IL-8, various ligands and receptors for tyrosine kinases such as insulin, insulin- 
like growth factor 1 (IGF-1), epidermal growth factor (EGF), heregulin-a and heregulin- 
p, vascular endothelial growth factor (VEGF), placental growth factor (PLGF), tissue 
growth factors (TGF-ct and TGF-P), other hormones and receptors such as bone 
10 morphogenic factors, folical stimulating hormone (FSH), and leutinizing hormone (LH), 
tissue necrosis factor (TNF), apoptosis factor-1 and -2 (AP-1 and AP-2), and proteins 
and receptors that share 20% or more sequence identity to these, and the like, nucleic 
acids, including both DNA and RNA, saccharide complexes, and the like. 

For the step(s) of screening libraries of candidate target binding fragments or 
15 candidate cross-linked target binding fragments for members having the ability to bind to 
a target biological molecule of interest, a simple ELISA assay may be used to (a) 
identify member(s) of the library that are capable of binding to the target, and (b) 
determine the approximate with which the library member(s) bind to the molecular 
target. While ELISA assays are preferred for screening libraries of organic compounds, 
20 virtually any in vitro assay that allows one to detect binding of the target biological 
molecule by an organic compound may be employed for screening the library, wherein 
such assays include ELISA, other sandwich-type binding assays, binding assays which 
employ labeled molecules such as radioactively or fluorescently labeled molecules, 
fluorescence depolarization, calorimetry, protein denaturation, resistance to proteolysis, 
25 gel filtration, equilibrium dialysis, surface plasmin resonance, X-ray crystallography, 
and the like. Such assays either measure the ability of library members to bind directly 
to the target biological molecule or are competition binding assays designed to measure 
the ability of library members to inhibit the interaction between the target biological 
molecule and another molecule that binds to the target biological molecule. Any of the 
30 above assays may be employed to screen libraries of candidate compounds to identify 
those that bind to a target biological molecule. 
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For the step of screening a library of candidate compounds to identify those that 
bind to a target biological molecule, it will be well within the skill level in the art to 
determine the concentration of the library members to be employed in the binding 
assay. For the most part, the screening assays will employ concentrations of candidate 
5 compounds ranging from about 0.01 to 10 mM, preferably from about 0.05 to 5 mM. 
The step of pre-screening a library of candidate target binding fragments to identify 
those that bind to a target biological molecule allows one to identify and isolate only 
those members of the library that have some binding affinity for the target. As such, in 
contrast to standard combinatorial library approaches, the small organic building blocks 
0 are "pre-screened" to select a smaller set of compounds that have some binding affinity 
for the target. Thus, the most productive organic compound building blocks can be 
identified for incorporation into the potential high affinity candidate cross-linked target 
binding fragments that are prepared therefrom, without having to screen all possible 
combinations of all of the initial candidate target binding fragment building blocks. 
Generally, the candidate target binding fragment library members selected as building 
blocks for subsequently prepared candidate cross-linked target binding fragments are 
those that have the highest affinity for binding to the target biological molecule. For the 
most part, candidate target binding fragments chosen as building blocks for 
incorporation into the subsequently prepared candidate cross-linked target binding 
fragments are those that bind to the target biological molecule with a of about lOmM 
or less, usually about 5mM or less, more usually about ImM or less, preferably about 
500 nM or less, more preferably about 100 jaM or less and most preferably about 50 jiM 
or less. However, for some applications, one or more of the candidate target binding 
fragment(s) chosen for incorporation into the subsequently prepared candidate cross- 
linked target binding fragments may have an individual for the target biological 
molecule of greater than 10 mM. 

Once candidate target binding fragments that bind to a target biological molecule 
with some desired degree of affinity are identified, at least a portion of those compounds 
(or structurally related analogs thereof) are chemically coupled via a cross-linker to 
provide a library of candidate cross-linked target binding fragments for binding to the 
target biological molecule, wherein those candidate cross-linked target binding fragments 
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comprise at least one candidate target binding fragment reacted or linked with a cross- 
linker. Usually two or more candidate target binding fragments (or structurally related 
analogs thereof) linked by a cross-linker are combined and in some cases these two 
fragments are the same. The two (or more) candidate target binding fragments (or 
5 structurally related analogs thereof ) incorporated into a candidate cross-linked target 
binding fragment may be the same (i.e., to provide a homodimer or homomultimer) or 
different (i.e., to provide a heterodimer or heteromultimer). Most commonly, the two 
candidate target binding fragments in the candidate cross-linked target binding fragment 
are different. 

10 By "structurally related analog", "analog" and the like, is meant a fragment 

compound that has the same chemical structure as a fragment identified as being capable 
of binding to the target biological molecule except that the analog has a different 
chemically reactive functionality or linkage functional group(LFG) for binding to the 
cross-linker than does the fragment that was identified as being capable of binding to the 

15 target biological molecule in the first or pre-screen. The analog may also optionally 
possess or lack one or more substituents that are either lacking or present, respectively, 
on the fragments identified in the pre-screening provided that the presence or absence of 
those substituents does not substantially alter the compounds ability to bind to the target. 
As such, while one may pre-screen a library of, for example, candidate oxime 

20 compounds to identify candidate oxime fragments that bind to the target biological 
molecule, one can chemically couple not the actual oxime compounds identified in the 
"pre-screening" but rather aldehydes having that same chemical structures as the oximes 
identified in the screen (but which have an aldehyde reactive functionality rather than an 
oxime reactive functionality). The present invention, therefore, encompasses not only 

25 chemical coupling of the actual compounds identified in the initial "pre-screening step" 
(e.g., aldehydes are pre-screened and also subsequently linked), but also the chemical 
coupling of structurally related analogs of those compounds (e.g., oximes are pre- 
screened but the analogous aldehydes are linked). 

As described above, candidate target binding fragment will comprise a 

30 chemically reactive functionality or linkage functional group (LFG) (or a site that can be 
converted to a chemically reactive functionality (BLFG)) to which a cross-linker may be 
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covalently bound, thereby providing a means for cross-linking two or more candidate 
target binding fragments having a LFG ( or analog or blocked form thereof) to provide 
a candidate cross-linked target binding fragment. Therefore, cross-linkers that find use 
herein will be multifunctional, preferably Afunctional, cross-linking molecules that can 
5 function to covalently bond at least two fragment compounds together via their reactive 
functionalities or LFG's. Linkers or cross-linkers (XL) will have at least one, two or 
more and preferably only two, chemically cross-reactive functional groups (CFG) on the 
cross-linker (XL). The chemically cross- reactive functional groups that are available for 
bonding to two or more candidate target binding fragments, wherein those functional 

10 groups may appear anywhere on the cross-linker, preferably at each end of the cross- 
linker and wherein those chemically cross-reactive functional groups may be the same or 
different depending upon whether the candidate target binding fragment is to be linked 
have the same or different chemically cross-reactive functional groups. Cross-linkers 
that find use herein may be substituted or unsubstituted straight-chain or branched alkyl, 

15 aryl, alkaryl, heteroaryl, heterocycle and the like. Preferably straight chain alkyl will 
generally be at least one methylene in length, more generally from 2 to 8 methylenes in 
length, and optionally as many as about 12 or more methylenes or the equivalent in 
length. Cross-linkers may include atoms or groups to increase or improve solubility of 
the library members, such as oxygen atoms interspersed between methylene groups 

20 creating ether or polyether linkers. Cross-linkers will generally comprise alkyl groups 
either saturated or unsaturated, and therefore, may comprise alkanes, alkenes or alkynes. 
Heteroatoms including nitrogen, sulfur, oxygen, and the like may also be appended to 
the alkyl to form groups such as; alkoxyl, hydroxy alkyl or hydroxy groups. Other 
cross-linking groups such as aryl, especially phenylene or substituted phenylene linkers 

25 are suitably employed. Usually cross-linker elements will be of varying lengths, 
thereby providing a means for optimizing the binding properties of a cross-linked target 
binding fragment prepared therefrom. 

In particularly preferred embodiments, cross-linkers may be 0,0'-diamino- 
alkanediol compounds, preferably O.O'-diamino-Ci-Qalkanediol, which are useful for 

30 chemically coupling aldehyde organic compounds, or any of a variety of different 
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diamine compounds, which are useful for chemically coupling aldehyde containing 
compounds. 

Various chemistries may be employed for chemically coupling candidate target 
binding fragments via a cross-linker to provide candidate cross-linked target binding 
5 fragments for binding to a target biological molecule. For example, many well known 
chemistries that can be employed for chemically coupling candidate target binding 
fragments(s) via a linker to form candidate cross-linked target binding fragments 
include, for example, reductive animations between aldehydes and ketones and amines 
(March, Advanced Organic Chemistry, John Wiley & Sons, New York, 4th edition, 
10 1992, pp. 898-900), alternative methods for preparing amines (March et al., supra, 
p. 1276), reactions between aldehydes and ketones and hydrazine derivatives to give 
hydrazones and hydrazone derivatives such as semicarbazones (March et al., supra, 
pp. 904-906), amide bond formation (March et al., supra, p. 1275), formation of ureas 
(March et al., supra, p. 1299), formation of thiocarbamates (March et al., supra, p. 892), 

15 formation of carbamates (March et al., supra, p. 1280), formation of sulfonamides 
(March et al., supra, p. 1296), formation of thioethers (March et al., supra, p. 1297), 
formation of disulfides (March et al., supra, p. 1284), formation of ethers (March et al., 
supra, p. 1285), formation of esters (March et al., supra, p. 1281), additions to epoxides 
(March et al., supra, p. 368), additions to aziridines (March et al., supra, p. 368), 

20 formation of acetals and ketals (March et al., supra, p. 1269), formation of carbonates 
(March et al., supra, p. 392), formation of enamines (March et al., supra, p. 1284), 
metathesis of alkenes (March et al., supra, pp. 1 146-1 148 and Grubbs et al., Acc. Chem. 
Res. 28:446-452 (1995)), transition metal -catalyzed couplings of aryl halides and 
sulfonates with alkenes and acetylenes (e.g., Heck reactions) (March et al., supra, 

25 pp.717-178), the reaction of aryl halides and sulfonates with organometallic reagents 
(March et al., supra, p. 662), such as organoboron (Miyaura et al., Chem. Rev., 95:2457 
(1995)), organotin, and organozinc reagents, formation of oxazolidines (Ede et al., 
Tetrahedron Letts. 38:7119-7122 (1997)), formation of thiazolidines (Patek et al.. 
Tetrahedron Letts. 36:2227-2230 (1995)), amines linked through amidine groups by 

30 coupling amines through imidoesters (Davies et al., Canadian J. Biochem. 50:416-422 
(1972)), and the like. 
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The step of chemically cross-linking, via a cross-linker, at least a portion of the 
candidate target binding fragments identified as described above as being capable of 
binding to the target biological molecule or structurally related analogs thereof provides 
a library of candidate cross-linked target binding fragments for binding to the target 
5 molecule that comprise at least two of the candidate target binding fragments or analogs 
thereof and the cross-linker. As previously stated the candidate target binding fragments 
incorporated into candidate cross-linked target binding fragments may be the same, 
thereby providing a homomultimer, or different, thereby providing a heteromultimer, 
and libraries of candidate cross-linked target binding fragments generally comprise both 

10 homo- and hetero-multimers. Candidate cross-linked target binding fragments for 
binding to the target molecule are preferably dimeric, however, candidate cross-linked 
target binding fragments that find use may also be trimeric, tetrameric, and the like, 
those compounds being obtained by employing cross-linkers having more than two 
chemically cross-reactive functional groups for cross-linking purposes. Candidate cross- 

15 linked target binding fragments for binding to a target biological molecule that find use 
herein will generally be less than about 1000 daltons in size and often less than about 750 
daltons in size. 

Libraries of candidate cross-linked target binding fragments for binding to the 
target biological molecule will generally comprise at least 1 candidate cross-linked target 

20 binding fragment, usually at least about 20 different candidates, more usually at least 
about 100 different candidates, preferably at least about 200 different candidates, more 
preferably at least about 500 different candidates, most preferably at least 1,000 different 
candidates and often 10,000 or more. Libraries of candidate cross-linked target binding 
fragments may be constructed such that each individual molecule of the library may be 

25 spatially separated from the other molecules of the library (e.g., each member of the 
library is in a separate microtiter well) or two or more members of the library may be 
physically combined if methods for decon volution are readily available. 

Once obtained, libraries of candidate cross-linked target binding fragments for 
binding to the target biological molecule will be screened for the purpose of identifying 

30 a member(s) of the library that is/are capable of binding to the target biological molecule 
with high affinity. For such purposes, any of the above described screening assays can 
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be employed, wherein preferably a biological assay such as an ELISA assay is 
employed. 

For the step of screening a library of candidate cross-linked target binding 
fragments to identify one or more that bind to a target biological molecule, it will be 
5 well within the skill level in the art to determine the concentration of the compounds to 
be employed in the binding assay. We have herein found that candidate cross-linked 
target binding fragments generated by chemically coupling organic compounds that bind 
to a target biological molecule often exhibit surprisingly high binding affinities for the 
target. For the most part, candidate cross-linked target binding fragments that serve as 

10 potential drug lead compounds or may have therapeutic efficacy on their own bind to the 
target biological molecules with a K^, of about 500 nM or less, usually about 100 nM or 
less, more usually about 50 nM or less. However, for various applications, one or 
more drug lead compound(s) having an individual K d for the target biological molecule 
of greater than 500 nM may also find use. 

15 Another embodiment of the present invention is directed to a method for 

inhibiting the interaction between first and second biological molecules which bind to 
each other, wherein the method comprises contacting a system comprising those 
molecules with a binding inhibitory amount of a candidate cross-linked target binding 
fragment or drug lead compound identified by the method described above, wherein the 

20 drug lead compound or cross-linked target binding fragment binds to the first biological 
molecule and inhibits its ability to bind to the second biological molecule. For the most 
part, the first and second biological molecules will be proteins, nucleic acids, saccharide 
complexes, and the like, preferably at least one being a protein, more preferably both 
being proteins. In particularly preferred embodiments, the first or second biological 

25 molecule may be CD4 or gpl20. In other preferred embodiments, the first biological 
molecule may be a protein wherein the second biological molecule is a receptor for that 
protein, a nucleic acid, either DNA or RNA, that binds to that protein or a 
polysaccharide or the first biological molecule is an enzyme wherein the second 
biological molecule is a substrate for that enzyme. 

30 Systems that comprise both the first and second biological molecules may be 

either in vitro or in vivo, wherein the first and second biological molecules are situated 
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such that they are capable of binding to one another. For in vivo applications, the lead 
of interest may be administered on its own or in pharmaceutical^ acceptable media, for 
example normal saline, PBS, etc. The additives may include bactericidal agents, 
stabilizers, buffers, or the like. In order to enhance the half-life of the drug lead 
5 compounds in vivo, the compounds may be encapsulated, introduced into the lumen of 
liposomes, prepared as colloids, or another conventional technique may be employed 
that provides for an extended lifetime thereof. 

The drug lead compounds may be administered as a combination therapy with 
other pharmacologically active agents or may physically linked to such agents or other 
10 carriers. Various methods for administration may be employed. The drug lead 
compounds may be given orally, or may be injected intra vascularly, subcutaneously, 
peritoneally, etc, A "binding inhibitory amount" of the drug lead compounds will vary 
widely, depending upon the nature of the first and second biological molecules, the 
frequency of administration, the manner of administration, the clearance of the 
15 compound from the host, and the like. Appropriate binding inhibitory amounts may be 
determined empirically by those skilled in the art in a routine manner, 
candidate cross-linked target binding fragments 

IL Definitions and preferred embodiments 

In its broadest embodiment, the method of this invention comprises: assembling a 
library of candidate target binding fragments; screening the library of candidate target 
binding fragments for those that bind to a target molecule; cross-linking target binding 
fragments to produce a library of candidate cross-linked target binding fragments; 
screening the library of candidate cross-linked target binding fragments for those that 
bind to the target molecule. The product of this method is referred to as a lead 
pharmaceutical or drug candidate. 

More specifically (see Figure 9), the method of this invention is used to identify 
lead pharmaceutical drug candidates and optionally involves the following simple steps 
(a-h). 
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(a) Assembling a library of candidate target binding fragments each 
fragment having a linkable (or linkage) functional group (LFG) or the blocked form 
thereof (BLFG), the blocked form containing linking group (LG); 



Assembling a library as used herein means any method of selecting two or more 
molecules to form a library for use in the method of this invention. Preferably the 
library will be large, greater than 50 members, and contain a diverse array of target 
interactive groups capable of forming non-covalent bonds, e.g. hydrogen, Van der 
Waals, electrostatic, hydrophobic and the like with the target molecule. Such interactive 
groups may include functional groups found on naturally occurring amino acid 
sidechains, carbohydrates, lipids, nucleic acids and their metabolites and derivatives 
thereof, or groups found on known pharmaceuticals. Optionally, a library may be 
customized to contain interactive groups known or suspected to interact with binding 
sites on the biological target molecule or its biological ligand. 

Candidate target binding fragments (CTBF) are small water soluble organic 
molecules having a molecular weight of about 200da (including the LFG) capable of 
forming a non-covalent complex with a target biological molecule (TBM). This complex 
may be of low affinity having a Kd as low as about 5mM. The CTBF's are 
commercially available or may be synthesized by known procedures. 

Linkable (or linkage) functional groups (LFG) include any functional groups 
capable of reacting with a chemically cross-reactive functional group (CFG) on a cross- 
linker (XL) thereby forming a stable covalent bond with the cross-linker. This covalent 
bond is referred to simply as a linking group (LG). When the linked molecule contains 
more than one linking group an integer following LG is used to indicate the number of 
the LG in the molecule. The LFG includes blocked, protected or otherwise transformed 
groups that may or may not react directly with the CFG on the cross-linker. This 
blocked form of the linkable functional group (BLFG) is often the preferred form of the 
CTBF because it may be less likely to form covalent linkages with the target biological 
molecule in the contacting step below. 
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In the case where the BLFG is to be used in the contacting step, the de-protected 
or re-transformed form of the CTBF that it is capable or reacting with the CFG on the 
cross-linker is the form used in the cross-linking step below. By way of illustration, a 
CTBF may contain an aldehyde as a LFG or this aldehyde may be protected or 
transformed by reacting it with an O-amino alcohol to form an oxime (BLFG) as shown 
below. 



In this case the oxime would be considered the BLFG. It will be appreciated that 
more than one reaction or transformation to an LFG may be made. By way of further 
illustration an aldehyde may be reacted with an amine to form a Schiff base, which in 
turn may be reduced to a secondary amine. This may be still further reacted to form an 
amide, sulfonamide, urea, carbamate etc. All these transformations of the initial 
aldehyde are also considered BLFG's. 

In the case where the CTBF (including the LFG) is first transformed or 
protected, the initial CTBF is sometimes referred to as a "precursor" while the 
transformed or protected form that is contacted with the TBM is referred to as a 
"monomer". Thus in the reaction illustrated above, the aldehyde may be called a 
precursor while the oxime is referred to as a monomer. In this case the oxime covalent 
bond ( = N-0-) is referred to as the linking group (LG). It is often preferred that the 
monomer contain the same linking group (LG) as is present in the cross-linked form, 
described below, because some of the binding energy with the target may come from 



Examples of linkage or linkable functional groups include; aldehyde, ketone, 
primary amine, secondary amine, epoxide, carboxylic acid, sulfonic acid, alcohol 
(hydroxyl), isicyanate, isothiocyanate, halide and sulfonate. These fuctional groups may 
act as precursors of the blocked linkage functional groups. 




JD-R 2 




LG. 
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Examples of CTBF's having blocked linkage or linkable functional groups are 
molecules containing Linkage Groups selected from; oxime, hydrazone, N-acyl 
hydrazone, secondary amine, tertiary amine, acetal, ketal, 1,2 amino alcohols, amide, 
N,N-disubstituted amides, thioamide, ureido, thioureido, carbamate, thiocarbamate, 
5 thiothiocarbamate, sulfonamide, carbamate, guanidino, amidino, thioester, ester, ether, 
2-hydroxyether, 2-hydroxythioether, thioether, disulfide, alkane (alkylene), alkene 
(alkenylene) and alkyne (alkynylene). Prefered monomers will contain the above 
functional groups as LG's. 

Steps b-d below may be combined into a single screening step that may be 
10 referred to as a first screening step, a pre-selection step or a pre-screening step. 

(b) Contacting the candidate target binding fragments with a 
target biological molecule (TBM). 

Contacting of the TBM with one or more members of the library may be 
conducted either individually or multiply. Preferably each candidate target binding 

15 fragment is contacted individually with the TBM. For example, this may conveniently 
be done in a 96 well format plate so that the formation of a complex with each member 
of the library can be conveniently evaluated without requiring any deconvolution step. 
The contacting step is often conducted at relatively high concentrations of the CTBF, so 
that Kd's as low as about 5 mM can be measured (see below). 

20 The Target Biological Molecule (TBM) may be any biological molecule 

preferably of mammalian and most preferably of human origin. Optionally preferred 
TBM's may be human pathogen proteins such as viral proteins from viruses that infect 
human cells. Preferred TBM's are proteins most preferably secreted proteins. Preferred 
secreted proteins include; enzymes, cytokines, hormones, growth factors and their 

25 receptors. The TBM's may be isolated from natural sources or made recombinately in a 
host cell. Normally the atoms of the TBM will contain the natural abundance isotopes, 
but in some circumstances may be enriched. When the target is a receptor or a cell 
surface bound molecule, the TBM may conveniently be the extracellular domain or a 
derivative thereof. 
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In a preferred embodiment of the method, the Target Biological Molecule is not a 
single biological molecule; rather it is two or more molecules. For example, preferred 
TBM's may be protein-protein, protein-DNA/RNA, protein-substrate pairs. By way of 
illustration, a ligand-receptor pair may be the actual target when an ELISA or other 
5 biological assay requiring two or more biological molecules is used to measure a 
physical association (see below) such as binding or activity. The binding constant, IC50 
or other measurementmay result from the CTBF binding with either the Ligand, receptor 
or both. Similarly, when selected fragments are cross-linked (see below) and re- 
screened against the ligand-receptor pair, the recombined fragments may bind with the 
10 ligand, receptor or both. 

(c) Measuring a change in a first physical association (PA-1) of the target 
biological molecule. 

Measuring a change as used herein means any method capable of quantifying a 
physical association, including binding, of the CTBF with the TBM. 

15 A physical association (PA-1) of the target molecule includes a biological 

property such as binding with another biological molecule, signal transduction or 
catalysis of a reaction. It may also include any measurable physical chemical property 
such as a spectroscopic or magnetic property. Preferably the physical association will be 
suitable for rapid high throughput screening. The most preferred physical association 

20 measurement will be a biological one such as measuring binding or catalysis. An 
example of binding measurement would be an ELISA assay where protein-protein 
antagonism is measured. 

(d) Selecting target binding fragments (TBF) based on (c). 

Selecting target binding fragments (TBF) is based on the physical association 
25 measurement step. Selected TBF's will include those that bind relatively weakly with 
the TBM. Thus, for example in a binding assay such as ELISA, fragments that bind 
with no greater than a 5 mM affinity may be selected for. Most commonly the first 
selected CTBF's wiirbind to the target with an affinity of from 2 mM to 100 ^M. 
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TBF's or monomers that bind with a higher affinity e.g. Kd < 50 jaM are preferred, 
however such relatively high binding affinities are not necessary for selection for the 
cross-linking step. 

(e) Reacting target binding fragments with a cross-linker, having 
5 chemically compatible cross- reactive group(s) with the LFG, under conditions 
suitable for forming a library of candidate cross-linked target binding fragments 
(CXL-TBF). 



The selected target binding fragments, which individually normally bind to the 
target with relatively low affinity, are then cross-linked, normally in all combinations 
10 and permutations, to produce a library of candidate cross-linked target binding fragments 
(CXL-TBF). By way or illustration, If TBF m and TBF n are selected from the first 
screen, these molecules are then reacted with a suitable cross-linker such as a 
Afunctional linker (BFL) to form candidate cross-linked target binding fragments (CXL- 
TBF) for the second screening step according to the general equation: 



15 



TBF m 


— LFG + 


TBF n 


—LFG + 


BFL 



The BFL above will normally have two cross-reactive groups that are compatible 
with both LFG's, that is they will form stable covalent bond(s) (LG) with the LFG's 
under selected reaction conditions. 

Each TBF may consist of two (or three) pans or moieties when LFG contains an 

20 atom capable of forming two (or three) bonds other than the linking group (LG) bond 
formed with the cross-linker. For example, when LFG is the carbonyl of a ketone, the 
carbonyl carbon may be bonded to two alkyl, aryl etc parts or moieties (part A and part 
B or part C and part D). This case may be represented generally by the diagram below 
where, for example, two ketones are cross-linked with an 0,0'-diamino-alkanediol 

25 cross-linker to form a di-oxime: 
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TBF m -part A 



TBF n -part C 



LG3 — XL 



TBF^part B 



TBF n -part D 



Here each of the two alkyl, aryl etc parts or moieties from the ketone are bonded to a 
nitrogen atom in the linking group (LG, =N-0-). 

The CXL-TBF's are sometimes referred to as "dimers", however, they are truly 
5 only dimers when the cross-linking moiety (XL) is only a chemical bond. In the most 
general case these "dimers" will contain a linking moiety, such as an alkane, alkene, 
arylene, alkyl ether and the like bonded through one or more linking groups (LG) to the 
corresponding TMF's. 

Reacting means chemically reacting so that a stable covalent bond or linking 
10 group LG is formed as the reaction product between the linkable functional group on the 
target binding fragment(s) and a chemically cross-reactive functional group (CFG) on the 
cross-linker. This step may be referred to as a cross-linking step or sometimes a 
"combination" or "recombination" step. 

Compatible functional group herein means capable of reacting with to form a 
15 stable LG. 

The cross-linker is a small organic molecule having at least one cross-reactive 
functional group capable of reacting with the linkable functional group of at least one of 
the target binding fragments. Commonly, the cross-linker will have 2-4 such cross- 
reactive functional groups and most commonly 2 such groups. In this case the cross- 

20 linker is referred to as a Afunctional linker (BFL), either homo-bifunctional or hetero- 
Afunctional depending on whether the compatible functional groups are the same or 
different from one another. 

In the case where the cross-linker has a single cross-reactive functional group, 
the candidate cross-linked target binding fragment, CXL-TBF, is simply the reaction 

25 product of these two molecules: 

LFG-, + 



TBF„ 



XLp — CFG 1 



CTBF 



LG-XL (or simply CXL-TBF) 
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20 



A simple example of such a reaction would be the product of an aldehyde and an 
amine. In this exemplary case Rl is a TBF and R2 Is XL where LFG-1 is an aldehyde 
and CFG-1 is a primary amine: 



R 



R, 



H 



-R, 



H 



In this case LG is the resulting secondary amine, -(NH)-. 

In the case where the cross-linker is a bifunctional linker, BFL, the linker has 
two cross-reactive linking groups and may be represented generally as: 



CFGh 



XL 



CFG, 



In this case, reacting two target binding fragments from the first screen above, each with 
its own linking functional group with the above BFL may be represented: 

+ CFG 1 1 XL [ — CFG 2 



TBF„ 



-LFG, 



TBF. 



—LFG, 



TBF n 



-LGt— | XL I — LGt— 



TBF n 



In the most general case of the above reaction, the two target binding fragments 
will have different linkable functional groups and will be linked by a hetero-bifunctional 
linker (het-BFL). In the most common case, both TBF will have the same CFG and will 
be linked by a homo-bifunctional linker (homo-BFL). In some cases, target binding 
fragments having different linkable functional groups may be linked with a homo-BFL 
and target binding fragments having the same linkable functional groups may be linked 
with a het-BFL. 

Higher order linking groups such as tri- and tetra-functional linkers may also be 
employed in the method of this invention. By way of illustration a trifunctional linker 
such as that shown below may be used: 



AX 



R— NH 2 



R 2 ~ NH 2 



R 3 — NH 2 



R 3 — N 
3 H 



^1 

HN 

XX 



IJJH 
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In each of the above cases, the product of the cross-linking reaction produces a candidate 
cross-linked target binding fragment(s) (CXL-TBF). These candidate molecules, 
typically with a molecular weight of 400 - 600 daltons, are then screened a second time 
against the target biological molecule. 
5 Steps f-h below may be combined into a single screening step that may be 

referred to as a second screening step, a selection step or a final screening step. 

(0 Contacting the candidate cross-linked target binding fragments with 
the target biological molecule. 

Contacting of the TBM and one or more members of the library of candidate 
10 cross-linked target binding fragments (CXL-TBF) may be conducted either individually 
or multiply as before. Preferably each candidate cross-linked target binding fragment is 
contacted individually with the TBM. The contacting step is usually conducted at lower 
concentrations of the CXL-TBF compared to CTBF's in the first screening step, so that 
Kd's in the micromolar or nanomolar range can be measured. The contacting format 
15 may be the same or different from that in the first or pre-screening step. Thus for 
example the two contacting steps may both be part of a binding assay (e.g. ELISA) or, 
for example, the second contacting step may be a functional activity assay or cell based 
binding assay. 

(g) measuring a change in a second physical association (PA-2) of the 
20 target biological molecule. 

The second physical association (PA-2) measurement for the CXL-TBF' s may be 
the same as that used for the CTBF's or may be a different physical measurement. 
Preferably the second physical association measurement will be a biological 
measurement rather than a physical chemical measurement such as spectroscopic or the 
25 like. 
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selecting cross-linked target binding fragments (XL-TBF) based of (g). 



The cross-linked target binding fragments (XL-TBF) selected will typically have 
a Kd, IC-50 or the equivalent of 500 nM or better. These XL-TBF's are useful as drug 
lead pharmaceutical candidate molecules. 

5 III Specific Chemistry 



Many chemistries may be employed to produce linkable functional groups or to 
block or derivitize them. Similarly a wide array or linking chemistries are possible. 
Described below are a number of chemistries suitable for rapid high throughput 
screening. This description is meant to be illustrative and not limiting. In the 
10 description below the term "CTBF" is equivalent to "CTBF-part A" plus " CTBF-part 
B" defined above. Similarly, Linkable functional group is used interchangeably with 
linkage functional group. 

The term "alkyl" means a cyclic, branched or unbranched saturated or 
unsaturated hydrocarbon radical having the number of carbons specified, or if no 
15 number is specified, up to 12 carbon atoms. 

The term "aryl" means a homocyclic aromatic hydrocarbon radical having from 
6-14 carbon atoms. Examples include phenyl, napthyl, biphenyl, phenanthrenyl, 
napthacenyl and the like. 

The term "heteroaryl" means a heterocyclic aromatic radical having from 4-13 
20 carbon atoms and from 1-6 heteroatoms selected from N, O, S and P. 

The term "heterocycle" means a saturated or partially unsaturated cyclic radical 
having from 3-13 carbon atoms and from 1-6 heteroatoms selected from O, S, N and P. 

The term "alkoxy" means an alkyl radical, as defined above, substituted with an 
oxo radical ( O-). 

25 The term "acyl" means alkanoyl or alkylcarbonyl having from 1-12 carbon 

atoms. 

The term "carboxy ester" means acyloxy or alkanoyloxy having from 1-12 
carbon atoms. 
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The term "carboxamide" means alkylcarbonylamino having from 1-12 carbon 

atoms. 

GENERAL NOTE: When CTBFs are employed with BLFGs, they may be 
prepared as described below from CTBFs containing the corresponding LFGs. 
5 Alternatively, it may be practical to purchase, otherwise acquire, or prepare by known 
methods the CTBFs with BLFGs using alternative chemistry. 

Candidate Target Binding Fragments or Molecules (CTBFs) 

1. Aldehyde and ketone as the linkage functional groups and the corresponding 
blocked linkage functional groups 
10 (a). The linkage functional group is the carbonyl group present in an aldehyde 

or a ketone. These CTBFs may be available commercially, or may be prepared by a 
variety of known methods to those practiced in the art. 
Aldehyde CTBFs are represented as follows. 



CTBF — CHO 



15 Ketone CTBFs are represented with two different parts (A and B) of the CTBF attached 
to ketone carbonyl as shown below. 



CTBF-part A 








CTBF-pa 


rt B 



(b). Alternatively, the library is assembled with the oxime group as the 
blocked linkage functional group (BLFG). These CTBF's are prepared by condensation 

20 of aldehydes or ketones LFG's with an O-substituted hydroxylamine. R 8 may be H, or 
a straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to 
five R 9 groups (R 9 is alkyl ? aryl, heteroaryl, heterocycle, carboxy ester, carboxamide, 
amino, jY-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also 

25 be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
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carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 

.OR 8 



CTBF 



-CHO 



H 2 N-OR 8 



N 

CTBF I— CH 



or 



CTBF-part A 



H 2 N-OR 8 



CTBF-part A 



N-OR 8 



CTBF-part B 



CTBF-part B 



(c). Alternatively, the library is assembled with the hydrazone group as the 
BLFG. These CTBFs are prepared by condensation of aldehydes or ketones LFGs with 
an /V-substituted hydrazine( March, Advanced Organic Chemistry, John Wiley & Sons, 
New York, 4 th edition, 1992, pp. 904-906). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 
(N, O, P, S) within the chain. R» may also be appended with up to five R 9 groups (R 9 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 7V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
TV-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

NHR 8 



CTBF —CHO 



H 2 N-NHR a 



CTBF 



N 
I! 

-CH 



or 



CTBF-part A 



H 2 N— NHR 8 



CTBF-part A 



N— NHR 8 



CTBF-part B 



CTBF-part B 



(d). Alternatively, the library is assembled with the N-acylhydrazone group as 
the BLFG. These CTBF's are prepared by condensation of aldehydes or ketone LFG's 
and an N-acyl hydrazine. Many reaction conditions are known to those practiced in the 
art (e.g., [March et al. supra, pp. 905-906] and [Li et al. Chem. Biol. y 1: 37 (1994)]). X 
may be nothing, O, S, NH or NR 9 R 8 or R 9 may be H, or a straight chain or branched 
alkyl group of length . 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
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P, S) within the chain. R 8 or R 9 may also be appended with up to five R 10 groups (R 10 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



O 



— CHO + H 2 N n m A 



x R . 

CTBF \—CH O 



or 



10 



15 



20 



CTBF-part A 



CTBF-part B 



^XR 0 



CTBF-part A 



o 



CTBF-part B 



(e). Alternatively, the library is assembled with an amine group as the BLFG. 

These CTBF's are prepared by reductive amination of aldehydes or ketone LFGs. A 

large number of reducing agents could be employed that are known to those practiced in 

the art (March, et al. supra, pp. 898-900). R 8 or R 9 may be H, or a straight chain or 

branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 

(N, O, P, S) within the chain. R 8 or R 9 may also be appended with up to five R 10 

groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 

alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 may also be an aryl or 

heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 

carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

r>IR 8 R 9 
-CH2 



CTBF —CHO + 



HNR 8 R 9 



reducing agent 



CTBF 



CTBF-part A 



or 



„ „ reducing agent 
^ + HNR 8 R 9 — 



CTBF-part B 



CTBF-part A 




— H 


J CTBF-p* 


irt B 



(f). Alternatively, the library is assembled with an acetal or ketal group as the 
BLFG. These CTBF's are prepared by condensing the aldehyde or ketone LFG's with a 
diol. Conditions for the preparation of acetals or ketals are known to those practiced in 
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the art (March, et al. supra, pp. 889-891). R 8 may be a straight chain or branched alkyl 
group of length 2 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). R 8 may be a straight chain or 
branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 heteroatoms 
(N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
N-acyiamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



-CHO + 



HO-R 8 -OH 



CTBF 



O-R 8 
i i 

-c-o 

H 



or 



CTBF-part A 



-r + HO-R 8 -OH 



CTBF-part A 



O-R 8 
-I— O 



CTBF-part B 



CTBF-part B 



(g). Alternatively, the library BLFG's may be prepared by condensing the 
aldehyde or ketone LFGS with an amino alcohol or an amino thiol. Methods to prepare 
the product oxazolidines and thiazolidines are known to those practiced in the art (e.g., 
oxazolidines: Ede, et al. Tetrahedron Letters, 38: 7119-7122 (1997), and thiazolidines: 
Patek et al. Tetrahedron Letters, 36:2227-2230 (1995)). R 8 or may be a straight chain 
or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five 
R 10 groups (R*° is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). R 9 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
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incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 9 may also be 
appended with up to five R n groups (R 11 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF —CHO + 



HO—R 8 — NHR 9 



O-R 8 

CTBF |— C-NR 9 
H 



or 



CTBF-part A 


— , 


r + 




CTBF-p 


art B 



HO-R^NHR 9 



CTBF-part A 




— NR 9 




CTBF-p 


art B 



10 (h). Alternatively, the library is assembled with the alkene group as the 

blocked linkage functional group (BLFG). These CTBF's are prepared by condensation 
of aldehydes or ketone LFG's with phosphorous ylides (Maryanoff et al. Chemical 
Reviews, 89, 863-927 (1989). R 8 and R 9 may be H or a straight chain or branched alkyl 
group of length 2 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 

15 within the chain. R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be an aryl or heteroaryl 
group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may 

20 also be halogens and heteroatoms. 



CTBF— CHO + 



,R 8 



1 ^ 

ctbf| — ch 



C R 9 



CTBF-part A 



CTBF-part A 



CTBF-part A 



-R* 



CTBF-part A 
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(i). Alternatively, carbanions, usually stabilized carbanions, may also be 
added into the carbonyl. Either the alcohol product A is obtained, or the hydroxy 1 group 
is eliminated to provide an alkene B. Numerous methods are available for performing 
this transformation (March et al., supra, 937-950). ). R 8 and R 9 may be H or a straight 
chain or branched alkyl group of length 2 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be halogens and heteroatoms. 



H / 

CTBF — CHO + C— — R 9 



OH 



base 



CTBF-CH^ / q 

1 R 9 



CTBF 



— CH 



15 



20 



CTBF-part A 



CTBF-part a] 



A 



base 



CTBF-part A 



HO v I 
\ / CH ~ F 



CTBF-part A 



CTBF-part A| 



CTBF-part A 



A B 

2. Primary and secondary amines as the linkage functional groups and the 
corresponding blocked linkage Junctional groups . 

(a). The linkage functional group is the basic nitrogen of primary or 
secondary amines. These CTBFs may be available commercially, or may be prepared 
by a variety of known methods to those practiced in the art. CTBF's that have a 
primary amine LFG's are represented as follows. 



CTBF — NH 



CTBF's that have a secondary amine LFG's are represented with two different parts (A 
and B) of the CTBF attached to the amine group as shown below. 
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CTBF-part A 



NH 



CTBF-part B 



(b). Alternatively, the library may be assembled as secondary or tertiary 
amine BLFGs. 

(i). CTBF's with secondary or tertiary amine BLFG's may be 
prepared by reductive amination of primary amine or secondary amine LFG's, 
respectively, with aldehydes and ketones. A large number of reducing agents could be 
employed that are known to those practiced in the art (March, et aL supra, pp. 898- 
900). R 8 or R 9 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
or R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). Numerous reductive amination 
methods may be used that are known to those practiced in the art. 



CTBF — NH 



R 9 COR* 



Reducing Agent 



:hr 8 r 9 



Pharmacophore! — NH 



CTBF-part A 



-NH + R^OR 8 



or 



Reducing Agent 



CHR°R 



8 D 9 



CTBF-part B 



CTBF-part A |— fjJ 



CTBF-part B 



(ii). Alternatively, CTBF's with secondary or tertiary amine BLFG's may be 
20 prepared by reaction of primary or secondary amine LFG's, respectively, with an aryl, 
heteroaryl or alkyl group substituted with a leaving group X, where X may be a halide 
or a sulfonate group (OSCbR where R is substituted or unsubstituted alkyl or aryl, e.g. 
CH3, CF3, phenyl-CH3 and phenyl-NC>2). The halide could be attached to aromatic or 
heteroaromatic functionality on R 8 , or it could be attached to an aliphatic group on R 8 . 
25 When X is substitute*} upon aromatic and heteroaromatic functionality, an S^Ar reaction 
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or a palladium-mediated (or related transtion metal mediated) amine coupling reaction 
would be performed [e.g, March, et al. supra, pp. 656-657; Wagaw et al., J. Am. 
Chem. Soc, 119: 8451-8458 (1997) and references therein; and Ahman et al. 
Tetrahedron Letters, 38: 6363-6366 (1997)]. Where X is substituted upon alkyl 
5 functionality, an Sn2 or SnI reaction would be performed (March, et al., supra, pp. 
411-413). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
10 R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



15 



CTBF I — NH 2 + X-R 8 



CTBF 



R 8 

— NH 



or 

R 8 



CTBF-part A — NH + X-R 8 



CTBF-part A — N 



CTBF-part B 



CTBF-part B 



(iii). Alternatively, CTBFs with secondary or tertiary amine BLFG's may be 
prepared by reaction of primary or secondary amine LFG's, respectively, with a 
substituted epoxide (March, et al., supra, p. 416). Many epoxides are available 
commercially. Alternatively they can be prepared by a number of known methods to 

20 those practiced in the art, most preferably by epoxidation of an alkene. R 8 to R 11 may 
be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 to R 11 may also 
be appended with up to five R*2 groups (R 12 is alkyl, aryl, halide, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 

25 sulphono). R 8 to R 11 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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CTBF 



11 



R 8 QH 



CTBF 




-NH 



,i 10 

,11 



or 



CTBF-part A | — fjJH 



♦ 2V 



CTBF-part A 



R 8 OH 



CTBF-part B 



I 



CTBF-part B 



(c). Alternatively, the library may be assembled as amide or thioamide BLFG's. 
5 These CTBFs may be prepared by coupling primary and secondary amine LFG's with 
carboxylic acids (X = H), carboxylic acid derivatives (X = OR, SR, halide), or the 
corressponding thiocarboxylic acid derivatives (X = OR, SR, halide). Numerous 
methods are also available for coupling carboxylic acids and carboxylic acid derivatives 
with amines and are known to those practiced in the art [e.g., March, et al., supra, pp. 

10 417-425; and Fields et al. Int. J. Peptide Protein Res, 35:181-187 (1990)]. R 8 may be 
H, or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, W-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also 

15 be an aryl or heteroaryl group that is optionally substituted (halide, alkyl, aryl, halide, 
heteroaryl, carboxy ester, carboxamide, amino, Af-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF 



-NH> 



Z 

X^R 8 

z = o,s 



CTBF 



-NH 
Z = O.S 



20 



or 



CTBF-part A f— NH 



CTBF-part B 



A 



Z = O.S 



CTBF-part A — N 



CTBF-part B 



(d). Alternatively, the library may be assembled as urea or thiourea BLFG's. 
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10 



15 



20 



25 



(i). These CTBF's may be prepared by condensation of primary or 
secondary amine LFGs and isocyanates or isothiocyanates. The direct coupling of 
isocyanates and isothiocyanates with amines is known to those practiced in the art 
(March, et al., supra, p. 903), R 8 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



-NHo 



Z =0, S 



CTBF 



Z 

A 



z = o,s 



or 



CTBF-part A — NH 



CTBF-part B 



2=C=N-R 8 
Z = O, S 



CTBF-part A 



l. 



T 



H 



CTBF-part B 



(ii). Alternatively, these CTBF's may be prepared by a two step process. 
In the first step, the primary or secondary amine LFG's is converted to carbamate 
(thiocarbamates) or related derivatives where X and Y are alkoxy groups, mercaptyl 
groups, halides, or other suitable leaving groups. In the second step a primary or 
secondary amine is added (e.g., Hutchins, Tetrahedron Letters, 35: 4055-4058, (1994) 
and references therein). R 8 and R 9 may be H, or a straight chain or branched alkyl 
groups of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is 
alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be aryl or heteroaryl 
groups that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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10 



(1) 



z 
JJ 

CTBF I NH 2 + X^Y 



CTBF 



(2) 


CTBF v 


or 


Z 


(1) CTBF-part A 


-NH + 

I 


CTBF-part B 




-A 

I 


(2) j CTBF-part A 



z = o, s 

2 

. N A Y + r 8 ^ nh 

H | 

z = o,s r9 



z 



CTBF 



z = o.s 



CTBF-part A 



CTBF-part B 



z = o, s 



* NH 



z 

x 

'N Y 
H 



Z 

X R 8 

H ' 9 
R 



Z 



CTBF-part B 



j CTBF-part A 



/ N 



CTBF-part B 



(iii). These CTBF's may also be prepared by condensation of primary or 
secondary amine LFG's and carbamates, thiocarbamates, or related derivatives where X 
is an alkoxy group, a mercaptyl group, or a halide (e.g., Hutchins, Tetrahedron Letters, 
35: 4055-4058, (1994) and references therein). R 8 and R 9 may be H, or a straight chain 
or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
aryl or heteroaryl groups that are optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino /V-acylarnino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — NH 2 



Z 

x A n .r 8 

z = o,s 



CTBF 



R 9 



z = o.s 



15 



or 
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10 



15 



20 



25 



CTBF-part A \ — f|JH 



+ x^n' rS 



CTBF-part A — rjj 



.K* 



CTBF-part B 



z = o,s 



CTBF-part B 



(d). Alternatively, the library may be assembled as sulfonamide BLFG's. These 
CTBF's may be prepared by coupling primary and secondary amine LFG's with sulfonic 
acids (X = H) or sulfonic acid derivatives (X = OR, SR, halide). Numerous methods 
are available for coupling sulfonic acids and sulfonic acid derivatives with amines and 
are known to those practiced in the art (e.g., March, et al., supra, p. 499, Greene, et al. 
Protective Groups in Organic Synthesis, John Wiley & Sons, New York, 2 nd edition, 
1991, pp. 379-385). R 8 may be H, or a straight chain or branched alkyl group of length 
1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 may also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, yV-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — NH 2 + 



o v o 

X^R 8 



CTBF 



H 



or 



CTBF-part A 



— NH 



X-^R 8 



CTBF-part A — N 



CTBF-part B 



CTBF-part B 



(e). Alternatively, the library may be assembled as carbamate, thiocarbamate or 
related BLFG's. These CTBF's may be prepared by condensation of primary and 
secondary amine pharmacophores with carbonyl derivatives (X = halide, OR, SR; Y = 
S,0; Z = S,0). Numerous methods are known to those practiced in the art (e.g., 
March, et al., supra, p. 418; and Greene, et al., supra, pp. 315-348). R 8 may be H, or 
a straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain.. R 8 may also be appended with up to 
five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
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acylamino, alkoxy, hydroxy, mercapto, phosphono, suiphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 



CTBF 



— NH 2 + 



Z 



YR* 



CTBF 



Z 

A 



YR C 



Z = O.S; Y = S, O 



or 



CTBF— Part A 



— NH 



Z 

.A 



YR 8 



CTBF — Part A — N 



Z 

Ayr 8 



CTBF— Part B 



Z = 0,S; Y = S, O 



CTBF— Part B 



(0- Alternatively, the library may be assembled as guanidine BLFGs. These 
CTBFs may be prepared by condensation of primary and secondary amine 
pharmacophores with carbonyl derivatives (X = halide, OS02R, OR, SR) [Roskamp, et 
al. Tetrahedron, 53:6697-6705 (1997)]. R» may be H, or a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, suiphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulfono). 

n NR 10 

CTBM I NH 2 + X^NR 8 R 9 ^ I CTBM 



NR 10 

u 

X^NR 8 R 9 



CTBM 



NR e R 



Bd9 



or 



CTBM-Fragment A — NH 



NR 



NR R 



i8 D 9 



NR 1 

A. 



CTBM-Fragment A Un^NR 8 R 9 



I 



CTB ft*— Fragment B 



CTBM-Fragment B 



3. Epoxides as the linkage Junctional groups and the corresponding blocked linkage 
Junctional groups. 
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(a). The linkage functional group is the epoxide group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Epoxide CTBFs are represented with four parts (A through D) of the CTBF attached to 
the epoxide group as shown below. Each fragment may be H, or functionality whereby 
a carbon atom is directly attached to the epoxide functionality. 



CTBF — Part A 



CTBF — Part B 



CTBF— Part C 



CTBf^-Part D 



(b). Alternatively, the library may be assembled as 1,2-amino alcohol BLFGs. 
These CTBFs may be prepared by coupling epoxide LFGs with primary or secondary 
amines employing known methods to those practiced in the art (March, et aL, supra, p. 
416).. R 8 or R 9 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
or R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
15 phosphono, sulphono). R 8 or R 9 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



| CTBF Part A K./\^H CTBF-Pari C """] [ CTBF-PartA~ 



CTBF-Part B \j , + NHR 8 R ! 



CTBF — Part D | | CTBF— Part B 




CTBF-Part C 



CTBF — Part D 



(c). Alternatively, the library may be assembled as, 2-hydroxy thioether BLFGs. 

20 These CTBFs may be prepared by coupling epoxide LFGs with thiols employing known 
methods to those practiced in the art (Wardell, in Patai The Chemistry of the Thiol 
Group, pt. 1; Wiley, New York, 1974, pp. 246-251). R 8 may be H, or a straight chain 
or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 

25 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or 
heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). The 2- 
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20 



hydroxy thioether BLFGs may also be readily oxidized to more warer soluble 2-hydroxy 
sulfoxide or sulfone BLFGs (March, et al., supra, pp. 1201-1203). 



CTBF-Part A k^/\ ^ | CTBF Part C | | CTBF — Part A U ^\ CTBF— Part C [ 

CTBF Part B 1 + HSR * I CTBF-Part B 



CTBF-Part D I I v ' ,P ^ r '" 10 f R»S N CTBF — Part D 




(d). Alternatively, the library may be assembled as, 2-hydroxy ether BLFGs. 
5 These CTBFs may be prepared by coupling epoxide LFGs with alcohols employing 
known methods to those practiced in the art(March, et al., supra, p. 391). R 8 may be 
H, or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
10 N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono), R 8 may also be an 
aryl or heteroaryl group that is optionally substituted (alkyh aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



| CTBF— Part A KxV°V^| CTBF— Pan C | j CTBF— Pan A 




CTBF Part B K N rTR ^ D , rt „ 1 + HOR ' " \ CTBF-Part B 



CTBF — Part O I UDr ^ rcm ° V R» 0 ^ CTBF-Part D 




CTBF-Part C 



4. Carboxylic acids as the linkage functional groups and the corresponding blocked 
15 linkage functional groups 

(a). The linkage functional group is the carboxylic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Carboxylic acid CTBFs are represented as follows. 



CTBF — COOH 



(b). Alternatively, the library may be assembled as amide BLFGs. These CTBFs 
may be prepared by coupling carboxylic acid LFGs (X = OH), or derivatives of 
carboxylic acid LFGs (X = OR, SR, halide) with primary or secondary amines. 
Numerous known methods are available for coupling carboxylic acids and carboxylic 
25 acid derivatives with amines to those practiced in the art [e.g., March, et al., supra, pp. 
417-425: and Fields et al. Int. 7. Peptide Protein Res. 35:181-187 (1990)]. R 8 and R 9 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 ta 10 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may 
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also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl. carboxy 
ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 and R 9 may also be an aryl or heteroaryl group that is optionally 
substituted (alkyi, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



o o 

CTBF I — ^ + HN-R 8 I CTBF 



NR 8 R 9 



10 



(c). Alternatively, amine BLFGs could be prepared by reduction of amide 
BLFGs prepared as described above in step 4b from the corresponding carboxylic acid 
LFGs. A number of reducing agents could be employed that are known to those 
practiced in the art (e.g., March, et al., supra, pp. 1212-1213). R 8 or R 9 may be H, or 
a straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also be appended with 
up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 or R 9 may 
15 also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, 
carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 

ft Reducing Agent . . ^R 8 R 9 



CTBF 



1 — 1/ ^ 

NR 8 R 9 



CTBF I— CH 2 



(d). Alternatively, the library may be assembled as ester BLFGs. 

20 (i). These CTBFs may be prepared by coupling carboxylic acid LFGs (X 

= OH), or derivatives of carboxylic acid LFGs (X = OR, SR, halide) with alcohols. 
Carboxylic acids and carboxylic acid derivatives may be coupled with alcohols 
employing numerous known methods to those practiced in the art [e.g., (March, et al., 
supra, pp. 392-398) and (e.g., Greene, et al., supra, pp. 227-228), and (Hughes et 

25 aI.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley & Sons, New 
York. 1992, vol. 42, pp. 343-347)]. R« may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
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heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 

mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 

optionally substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, 

amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

O O 

CTBF | — ^ + HO-R 8 I CTBF I — \ 



OR e 



(ii). These CTBFs may be prepared by reacting carboxy lie acid LFGs 
with an aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may 
be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 

10 aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, an 
SnAr reaction or a palladium-mediated, copper-mediated or related transtion metal 
mediated coupling reaction would be performed. Where X is substituted upon alkyl 
functionality, an Sn2 or SnI reaction would be performed. Methods for these 

15 transformations are known to those practiced in the art [e.g., (March, et al., supra, pp. 
398-399) and (e.g., Greene, et al., supra, pp. 228-229)]. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 
groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, 

20 alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or 
heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

O O 

CTBF I — ^ + X-R 8 ^ I CTBF I — \ 



OH 



'OR 8 



25 (e). Alternatively the library may be assembled as thioester BLFGs. 

(i). These CTBFs may be prepared by condensation of carboxy lie acid 
LFGs (X = OH), or carboxylic acid derivatives (X = OR, SR, halide) and thiols. 
Carboxy lie acids and carboxylic acid derivatives may be coupled with thiols employing 
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known methods to those practiced in the art (e.g., March, et a)., supra, p. 409). R 8 
may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
halide, heteroaryl, carboxy ester, carboxamide, amino, TV-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



5. Sulfonic acids as the linkage functional groups and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the sulfonic acid. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Sulfonic acid CTBFs are represented as follows. 



(b). Alternatively, the library may be assembled as sulfonamide BLFGs. These 
CTBFs may be prepared by reacting sulfonic acid LFGs (X = OH) or derivatives of 
sulfonic acids (X = halide, alkoxyl, mercaptyl) with amines (e.g., March, et al., supra, 
p. 499, Greene, et al. supra, pp. 379-385). R 8 and R 9 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 
(N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to five R 10 
groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be an aryl or 
heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 




CTBF — SQ 3 H 



CTBF 



+ HN— R 8 



CTBF 




N-R 8 
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6. The hydroxy! group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the hydroxy 1 group. 

These CTBFs may be available commercially, or may be prepared by a variety of 
known methods to those practiced in the art- 
Alcohol CTBFs are represented as follows. 



CTBF 



-OH 



(b). Alternatively the library may be assembled as ether BLFGs. 

(i). The ether may be prepared by reaction of an alcohol LFG with an 

10 aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OSO^R where R is substituted or unsubstituted alkyl or aryl, 
e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to aromatic 
or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic group on 
R 8 . When X is substituted upon aromatic and heteroaromatic functionality, an S^Ar 

15 reaction or a palladium-mediated, copper-mediated or related transtion metal mediated 
coupling reaction would be performed [(e.g., March, et al., supra, pp. 654-655) and 
Hartwig et al., supra, pp. 8005-8008). Where X is substituted upon alkyl functionality, 
an Sn2 or SnI reaction would be performed. R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 

20 (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

OH + x— R 8 > 



CTBF 



CTBF H O-R 8 



(ii). The ether may be prepared by the Mitsunobu reaction between alcohol LFGs 
and another alcohol where the second alcohol is acidic (pKa < 12), for example, 
phenols and oximes (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, 
John Wiley & Sons,- New York, 1992, vol. 42, pp. 335-636). R 8 may be H, or a 
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straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to 
five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). The key 
requirement on R 8 is that the alcohol has a pKa < 12. 

>8 



CTBF 



— OH + HO — R 



CTBF — O-R 



(iii). The ether may also be prepared by the Mitsunobu reaction between 
10 alcohol LFGs and another alcohol, where the alcohol LFGs are acidic (pK a < 12), for 
example, phenols or oximes (Hughes et aI.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636). R 8 may be H, 
or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with 
15 up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
A/-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). The key requirement 
on the pharmacophore substituted alcohol is that it has a pK a < 12. 

—OH + ho— R 8 



CTBF 



CTBF — O-R 8 



(c). Alternatively the library may be assembled as ester (Z = O) or thioester (Z 
20 = S) BLFGs. Alcohol LFGs may be coupled with carboxylic acids (Z = O, X = OH), 
carboxylic acid derivatives (Z = O, X = OR, SR, halide), or the corresponding thio- 
substituted derivatives (Z = S, X = OH, OR, SR halide). Carboxylic acids and 
carboxylic acid derivatives may be coupled with alcohols employing a variety of known 
methods to those practiced in the art [e.g., (March, et al., supra, pp. 392-398) and (e.g., 
25 Greene, et al., supra, pp. 227-228), and (Hughes et al.(Paquette, Series Editor in 
Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 343- 
347)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is halide, alkyl, aryl, heteroaryl, 
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carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, 
phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is optionally 
substituted (halide, alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

z 



CTBF 



JL . , — i > 



R 8 

8 



CTBF — O 



-OH + X R 
5 2 = O.S 

(d). Alternatively the library may be assembled from thioether BLFGs. The 
thioethers may be prepared by the Mitsunobu reaction between alcohol LFGs and a thiol 
(Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley & Sons, 
New York, 1992, vol. 42, pp. 365-366, 381-382). R 8 may be H, or a straight chain or 

10 branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 
(N, O, P, S) within the chain. R 8 may also be appended with up to Five R 9 groups (R 9 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 

15 /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). The thioether BLFGs may also 
be readily oxidized to more water soluble sulfoxide or sulfone BLFGs. 

>8 



CTBF 



-OH + HS— R° 



CTBF 



-S— R° 



(e). Alternatively the library may be assembled as carbamate (Z = O) or 
thiocarbamate (Z = S) BLFGs. 

20 (i). These CTBFs may be prepared by condensation of alcohol LFGs and 

isocyanates or isothiocyanates. The direct coupling of isocyanates and isothiocyanates 
with alcohols is straighforward and obvious to those practiced in the art (March, et al., 
supra, pp. 891-892). R 8 may be H, or a straight chain or branched alkyl group of 
length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the 

25 chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
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optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —OH + 2~C = N — R 8 



CTBF 



8 



z 

H 

z = o, s z = o,s 

(ii). These CTBFs may be prepared by condensation of alcohol LFGs and 
5 carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl group 
of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within 
the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, jV-acylamino, alkoxy, hydroxy, 
10 mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 

j 8 z 

CTBF f~ OH + X^N' R8 I CTBF k-A K1 .R 8 



R 9 

z = o,s z = o.s 

(iii). These CTBFs may be prepared in a two step process (e.g., March, 
15 et al., supra, p. 418; and Greene, et al., supra, pp. 315-348). In the first step, the 
alcohol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or related 
derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or other 
suitable leaving groups. In the second step an amine is added to displace the leaving 
group Y. R 8 and R 9 may be H, or a straight chain or branched alkyl groups of length 1 
20 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. 
R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 and R 9 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino 
25 /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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(1) 



CTBF —OH 



Z 

z = o, s 



CTBF 



(2) 



CTBF 



Z 

^cAy 
z = o,s 



NH 



CTBF 



Z 

A. 



R 9 



z = o,s 

(f)- Alternatively the library may be assembled with thioester BLFGs. The 
thioesters may be prepared by the Mitsunobu reaction between alcohol LFGs and a thiol 
acid (Hughes et al.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley & 
Sons, New York, 1992, vol. 42, pp. 343-347). R 8 may be H, or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 
(N. O. P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 
is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group 
that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, 
/V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

O 



CTBF 



—OH 



HS 



-A. 



CTBF 



(g). Alternatively the library may be assembled with amide, thioamide, urea, 
thiourea, sulfonamide, carbamate, thiocarbamate (Z = SO2, CO, CS, CO2, COS, CSO, 
CONR 11 , CSNR 11 .) BLFGs. Substitution of the hydroxyl LFGs are accomplished using 
the Mitsunobu reaction [ e.g., (Hughes et al.(Paquette, Series Editor in Chief), Organic 
Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335-636) and (Fukuyama, 
et al., Tetrahedron Letters, 38: 5831-5834 (1997) and references therein)]. An aliphatic 
hydroxyl group is prefered. R 8 and R 9 may be H, or a straight chain or branched alkyl 
group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 and R 9 may also be appended with up to five R 10 groups (R 10 is 
alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). . R 8 and R 9 may also be an aryl or 
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heteroaryl group that is optionally substituted (alkyh aryl, halide, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF —OH 



+ N R 8 



CTBF — N 



Z-R« 



\ 



Z = S0 2 , CO, CS. C0 2 , COS, CSO, CONR 10 , CSNR 10 

(h). Alternatively, the library man be assembled with amine BLFGs. The amine 
BLFGs will be prepared from the BLFGs described in 6g. The acyl or sulfonyl 
functionality may be removed by methods known to those practiced in the art (such as, 
acidic or basic hydrolysis, or dissolving metal reactions for sulfonamides Greene et al., 
supra, 349-357 and 379-385). Milder conditions may be applied to more specialized 
groups, e.g., trifluoroacetamides may be cleaved by mild basic hydrolysis (Greene et 
al., supra , 353-354), and nitrosubstituted benzenesulfonamides may be cleaved by 
thiolate addition) to provide secondary amines (Fukuyama, et al., Tetrahedron Letters, 
38: 5831-5834 (1997) and references therein). Alternatively for some derivatives, 
reduction will provide tertiary amines (e.g., March, et al., supra, pp. 1212-1213). 



CTBF 



Z— R 8 



— N 



CTBF — N 



H 



15 



20 



or 



CTBF — N 



Z-R 8 



>11 



reduction 



\ 



CTBF 



Z = S0 2 , CO, CS, C0 2 , COS, CSO, CONR 10 , CSNR 10 

7. The thiol group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the thiol group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

Thiol CTBFs are represented as follows. 



CTBM — SH 



(b). Alternatively the library may be assembled as thioether BLFGs. 
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(i). The thioether may be prepared by reaction of a thiol LFG with an 
aryl, heteroaryl or alkyl group substituted with a leaving group X, where X may be a 
halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to 
5 aromatic or heteroaromatic functionality on R 8 , or it could be attached to an aliphatic 
group on R 8 . When X is substituted upon aromatic and heteroaromatic functionality, an 
SftAr reaction or a palladium-mediated, copper-mediated or related transtion metal 
mediated coupling reaction would be performed. Where X is substituted upon alkyl 
functionality, an Sj^2 or Sjsjl reaction would be performed. These methods are known to 

10 those practiced in the art [(March et aL, supra, pp. 407-409) and (Peach in Patai The 
Chemistry of the Tfiiol Group, pt. 1, John Wiley & Sons, New York, 1974, pp. 721- 
735)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 

15 carboxamide, amino, jV-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). The thioether BLFGs may also b oxidized to more water 
soluble sulfoxide and sulfone BLFGs (March, et al., supra, pp. 1201-1203). 

— SH + X — R 8 > 



20 



30 



CTBF 



CTBF — S-R 8 



(ii). The thioether may also be prepared by the Mitsunobu reaction 
between alcohol LFGs and a thiol (e.g., (Hughes et al. (Paquette, Series Editor in 
Chief), Organic Reactions, John Wiley & Sons, New York, 1992, vol. 42, pp. 335- 
636).. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, which 
25 may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also 
be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, TV-acylamino, alkoxy, mercapto, phosphono, sulphono). The 
thioether BLFGs may also b oxidized to more water soluble sulfoxide and sulfone 
BLFGs (March, et al., supra, pp. 1201-1203). 

CTBF I — SH + HO— R 8 I CTBF 



— S— R 8 
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(c). Alternatively the library may be assembled with BLFGs whereby the thiol 
LFGs are acylated (March et al., supra, p. 409). Thiol LFGs may be coupled with 
carboxylic acids (Z = O, X = H), carboxylic acid derivatives (Z = O, X = OR, SR, 
halide), or the corresponding thio-substituted derivatives (Z = S, X = H, OR, SR 
5 halide). Carboxylic acids and carboxylic acid derivatives may be coupled with thiols 
employing known methods to those practiced in the art. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 
groups (R 9 is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
10 acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl 
or heteroaryl group that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, /V- acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



CTBF — S 



CTBF I — SH + X R 8 

Z = O.S 

15 (d). Alternatively the library may be assembled as thiocarbamate (Z = O) or 

thiocarbamate (Z = S) BLFGs. 

(i). These CTBFs may be prepared by condensation of thiol LFGs and 
isocyanates or isothiocyanates. The direct coupling of isocyanates and isothiocyanates 
with thiols is straighforward and obvious to those practiced in the art (Greene et al. 

20 supra, p. 301). R 8 may be H, or a straight chain or branched alkyl group of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 

25 (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino /V-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 

z 



CTBF — SH + Z=C=N-R 8 



CTBF 



"S N 

H 

Z = O, S Z = O.S 
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(ii). These CTBFs may be prepared by condensation of thiol LFGs and 
carbamates, thiocarbamates, or related derivatives where X is an alkoxy group, a 
mercaptyl group, or a halide. R 8 may be H, or a straight chain or branched alkyl group 
of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within 
the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF — SH 



X' 



CTBF 



2 

.A 



1ST 



z = o.s 



z = o.s 



(iii). These CTBFs may be prepared in a two step process. In the first 
step, the thiol LFGs are converted to carbonate (Z = O) or thiocarbonate (Z = S) or 
related derivatives where X and Y are alkoxy groups, mercaptyl groups, halides, or 
other suitable leaving groups (Greene et aL supra, pp. 299-301). In the second step an 
amine is added to displace the leaving group Y. R 8 and R 9 may be H, or a straight 
chain or branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

z 

A, 



(1) 



CTBF — SH 



Z 

2 = O, S 



CTBF 



(2) 



CTBF 



Z 

.A. 



Z = O.S 



NH 



CTBF 



I 

R 9 



Z = O.S 
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(e). Alternatively the library may be assembled as disulfide BLFGs. The 
disulfides may be prepared by reacting the thiol LFGs with thiols (X = H) or activated 
thiols (X = mercaptyl, halide, sulfonyl) employing known methods to those practiced in 
the art (e.g., Greene et al., supra, pp. 302-303). R 8 may be H. or a straight chain or 
branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms 
(N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 
is halide, alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, /V-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or 
heteroaryl group that is optionally substituted (halide, alkyl, aryl, halide, heteroaryl, 
carboxy ester, carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, 
phosphono). 



CTBF — SH + XS-R 8 



CTBF _s— S-R 8 



8. Vie isocyanate or isothiocyanate group as the linkage functional group and the 
corresponding blocked linkage functional groups 
15 (a). The linkage functional group is the isocyanate (Z = O) or isothiocyanate (Z 

= S) group. These CTBFs may be available commercially, or may be prepared by a 
variety of known methods to those practiced in the art. 

Isocyanate (Z = O) or isothiocyanate (Z = S) CTBFs are represented as follows. 

-N=C=Z 



CTBF 



z = o, s 

20 (b). Alternatively the library may be assembled with urea or thiourea BLFGs by 

reaction of the isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with amines (March 
et al., supra, p. 903). The direct coupling of isocyanates with amines is straighforward 
and known to those practiced in the art. R 8 and R 9 may be H, or a straight chain or 
branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 

25 heteroatoms (N, O, P, S) within the chain. R 8 and R 9 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 and R 9 may also be 
an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino /V-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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CTBF | — N=C=Z 

z = o, s 



HN-R 8 



CTBF 



R 



(c). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with thiols (Green et al., supra, p. 
301). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, /V-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino /V-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

.JL.R. 



CTBF h-N=C=Z 

Z = O, S 



HS-R 8 



CTBF 



H 



(d). Alternatively the library may be assembled as BLFGs by reaction of the 
isocyanate (Z = O) or isothiocyanate (Z = S) LFGs with alcohols (March et ah, supra, 
pp. 891-892). R 8 may be H, or a straight chain or branched alkyl groups of length 1 to 
10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 
may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, jV-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 may also be an aryl or heteroaryl group that is optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino A-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 



CTBF 



-N=C=Z + HO-R 8 

z = o, s 



CTBF 



H 



9. The halide or related sulfonate group as the linkage functional group and the 
corresponding blocked linkage functional groups 

(a). The linkage functional group is the halide leaving group or a similarly 
reactive sulfonate (X= OS0 2 R where R is substituted or unsubstituted alkyl or aryl, 
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e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-NO : ) leaving group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

The halide and sulfonate (X= OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3) phenyl-CH 3 and phenyl-N0 2 ) CTBFs may be represented 
as follows. 



CTBF 



(b). Alternatively the library may be assembled with amine BLFGs. 

(i). The amine BLFGs may be prepared by substitution of the leaving 

10 group (X = halide or OSO.R where R is substituted or unsubstituted alkyl or aryl, e.g. 
CH 3 . CF 3 , phenyl-CH 3 and phenyl-NO-,) with amines The leaving group may be attached 
to aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached 
to aliphatic functionality. A number of methods are known to those practiced in the art 
[e.g. March, et al. supra, pp. 656-657 and 411-413; Wagaw et al., J. Am, Chem. Soc. f 

15 119: 8451-8458 (1997) and references therein; and Ahman et al. Tetrahedron Letters, 
38: 6363-6366 (1997)]. When X is substituted upon aromatic and heteroaromatic 
functionality, an SNAr reaction may be performed. If the leaving group X is substituted 
upon aromatic, heteroaromatic or alkenyl functionality, a palladium-mediated (or related 
transition metal mediated) amine coupling reaction may be performed . Where X is 

20 substituted upon alkyl functionality, an Sn2 or S^l reaction may be performed. R 8 or 
R 9 may be H, or a straight chain or branched alkyl group of length 1 to 10, which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 or R 9 may also 
be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 

25 R 8 or R 9 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBF 



— X + NHR B R 9 >- CTBF — NR 8 R 9 



(ii). The amine BLFGs may also be prepared by a two step process. In the first 
30 step, substitution of the leaving group X with an amide, thioamide, urea, thiourea 
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sulfonamide, carbamate, thiocarbamate (Z = SCb, CO, CS, CCb, COS, CSO, CONR 11 , 
CSNR 11 ) is performed (see 0 - 0 below). X may be a halide or OSO.R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 . 
The leaving group may be attached to aromatic or heteroaromatic functionality, alkenyl 
5 functionality, or it may be attached to aliphatic functionality. . When X is substituted 
upon aromatic or heteroaromatic functionality, an SNAr reaction may be performed. 
When X is substituted upon aromatic, heteroaromatic, allylic or alkenyl a palladium- 
mediated (or related transtion metal mediated) amine coupling reaction may be 
performed. Where X is substituted upon alkyl functionality, an S^2 or SnI reaction 
10 may be performed. Methods are known to those practiced in the art (March et al., supra, 
pp. 425-427). In the second step, the acyl or sulfonyl functionality may be removed by 
methods known to those practiced in the an (such as, acidic or basic hydrolysis, or 
dissolving metal reactions for sulfonamides (Greene et a!., supra, 349-357 and 379- 
385). Milder conditions may be applied to more specialized groups, e.g., 

15 trifluoroacetamides may be cleaved by mild basic hydrolysis (Greene et al., supra, 353- 
354), and nitrosubstituted benzenesulfonamides may be cleaved by thiolate addition) to 
provide secondary amines (Fukuyama, et ah, Tetrahedron Letters, 38: 5831-5834 
(1997) and references therein). Alternatively for some derivatives, reduction will 
provide tertiary amines (e.g., March, et al., supra, pp. 1212-1213). R 8 to R 11 may be 

20 H, or a straight chain or branched alkyl group of length 1 to 10, which may incorporate 
from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 to R 11 may also be 
appended with up to five R 12 groups (R 12 is alkyl, aryl, halide, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
sulphono). R 8 to R 11 may also be an aryl or heteroaryl group that is optionally 

25 substituted (alkyl, aryl, halide, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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(1) 



CTBF 



+ N R 



CTBF 



Z-R B 



R 9 



2 = S0 2 . CO. CS, C0 2 . COS. CSO. CONR 10 , CSNR 10 



(2) 



,2-R e 



CTBF — 



CTBF 



10 



15 



20 



CTBF N 



Z-R 8 



or 

reduction 



CTBF 



(c). Alternatively the library may be assembled with ether BLFGs. The ether 
BLFGs may be prepared by displacement of leaving group LFGs with an alcohol. X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could be 
attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an S^Ar reaction may be performed. When X is substituted 
on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium-mediated, Cu 
mediated, or related trans tion metal mediated coupling reaction may be performed 
performed [(e.g., March, et al., supra, pp. 654-655) and Hartwig et al. Tetrahedron 
Lett., 38: pp. 8005-8008). Where X is substituted upon alkyl functionality, an S N 2 or 
SnI reaction may be performed (e.g., March, et al., supra, pp. 386-387 and 388-389). 
R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10. which may 
incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

—X + ho-R 8 > 



CTBF 



CTBF — O-R 8 



(d). Alternatively the library may be assembled with thioether BLFGs. The 
thioether BLFGs may. be prepared by displacement of leaving group LFGs with an thiol. 
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X may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstiruted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 
attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could be 
attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an S^Ay reaction may be performed. When X is substituted 
on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium-mediated. Cu 
mediated, or related transtion metal mediated coupling reaction may be performed. 
Where X is substituted upon alkyl functionality, an Sn2 or S^l reaction may be 
performed. These methods are known to those practiced in the art [(March et al., supra, 
407-409) and (Peach in Patai The Chemistry of the Thiol Group, pt. 1, John Wiley & 
Sons, New York, 1974, pp. 721-735)]. R$ may be H, or a straight chain or branched 
alkyl group of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 
aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
15 mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). The thioether may also be oxidized 
to the more water soluble sulfoxide or sulfone BLFGs (March et al., supra, pp. 1201- 
1203). 



10 



20 



30 



CTBF 



— X + HS-R* ^ CTBF — S-R 8 



(e). Alternatively the library may be assembled with ester BLFGs. The ester 
BLFGs may be prepared by displacement of leaving group LFGs with a carboxylate. X 
may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 
25 attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could be 
attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an S^Ar reaction may be performed. When X is substituted 
on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium-mediated, Cu 
mediated, or related transtion metal mediated coupling reaction may be performed. 
Where X is substituted upon alkyl functionality, an S N 2 or S N 1 reaction may be 
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10 



performed. . Methods for these transformations are known to those practiced in the art 
[e.g., (March, et al., supra, pp. 398-399) and (e.g., Greene, et al., supra, pp. 228- 
229)]. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 

x , , y«> 

> R 8 *- CTBF — O 



CTBF —X + IVT -Cr"R 8 



(f). Alternatively the library may be assembled with thiolester BLFGs. The 
ester BLFGs may be prepared by displacement of leaving group LFGs with a thiolacid. 
X may be a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted 
alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be 

15 attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it could be 
attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an S N Ar reaction may be performed. When X is substituted 
on aromatic, heteroaromatic, allylic, or alkenyl functionality a palladium-mediated, Cu 
mediated, or related transtion metal mediated coupling reaction may be performed. 

20 Where X is substituted upon alkyl functionality, an Sn2 or Sjsjl reaction may be 
performed. R 8 may be H, or a straight chain or branched alkyl group of length 1 to 10, 
which may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may 
also be appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 

25 R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



CTBFj-X + SH^R 8 - | CTBF H 

-68- 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCtD: <WO 9949314A1_IA> 



WO 99/49314 



PCT/US99/06734 



10 



15 



20 



(g). Alternatively the library may be assembled with urea or thiourea BLFGs. 
These BLFGs may be prepared by displacement of leaving group LFGs with a primary 
or secondary ureas (Z = O) or thioureas (Z = S). X may be a halide or a sulfonate 
group (OS0 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , 
phenyl-CF 3 and phenyl-N0 2 ). The halide could be attached to aromatic or 
heteroaromatic functionality, alkenyl functionality, or it could be attached to an aliphatic 
functionality. When X is substituted upon aromatic or heteroaromatic functionality an 
S^Ar reaction may be performed. When X is substituted on aromatic, heteroaromatic, 
allylic or alkenyl functionality a palladium-mediated, Cu mediated, or related transtion 
metal mediated coupling reaction may be performed. Where X is substituted upon alkyl 
functionality, an Sn2 or S^l reaction may be performed. R 8 to R 10 may be H, or a 
straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 10 may also be appended with 
up to five R 11 groups (R 11 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). . R 8 to R 10 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono). 



(h). Alternatively the library may be assembled with sulfonamide BLFGs. The 
sulfonamide BLFGs may be prepared by displacement of leaving group LFGs with a 
sulfonamide. X may be a halide or a sulfonate group (OS0 2 R where R is substituted or 
unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide 
could be attached to aromatic or heteroaromatic functionality, alkenyl functionality, or it 
could be attached to an aliphatic functionality. When X is substituted upon aromatic or 
heteroaromatic functionality an SNAr reaction may be performed. When X is substituted 
on aromatic, heteroaromatic, allylic or alkenyl functionality a palladium-mediated, Cu 
mediated, or related transtion metal mediated coupling reaction may be performed. 
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Where X is substituted upon alkyl functionality, an Sn2 or SnI reaction may be 
performed.. These methods are known to those practiced in the art [(March et al., 
supra, pp. 425-427) and (Fukuyama et al., Tetrahedron Letters, 38:5831-5834 (1997)]. 
R 8 may be a straight chain or branched alkyl group of length 1 to 10, which may 
5 incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be 
appended with up to five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). 
R 8 may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
10 mercapto, phosphono). 



O O nn 



CTBF I — X + "N X R 



8 



R 9 



CTBF 



/ 
■N 

X o9 



(i). Alternatively the library may be assembled with carbamate, thiocarbamate, 
or related BLFGs. These BLFGs may be prepared by displacement of leaving group 
with the corresponding carbamate, thiocarbamate, or related BLFGs. X may be a halide 

15 or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or aryl, e.g. 
CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). The halide could be attached to aromatic or 
heteroaromatic functionality, alkenyl functionality, or it could be attached to an aliphatic 
functionality. When X is substituted upon aromatic or heteroaromatic functionality an 
SwAr reaction may be performed. When X is substituted on aromatic, heteroaromatic, 

20 allylic or alkenyl functionality a palladium-mediated, Cu mediated, or related transtion 
metal mediated coupling reaction may be performed. Where X is substituted upon alkyl 
functionality, an Sn2 or SnI reaction may be performed. R 8 to R 9 may be H, or a 
straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. . R 8 to R 9 may also be appended with 

25 up to five R 11 groups (R*° is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, 
amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). . R 8 to R 10 
may also be an aryl or heteroaryl group that is optionally substituted (alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono. 
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CTBF | — X + N YR 



R 



9 



CTBF 

— r V 

Z = O, S; Y = O.S 



(j). Alternatively the library may be assembled by BLFGs where the leaving 
group is replaced with carbon-based functionality. 

(i). X may be a halide or a sulfonate group (OSO : R where R is 
5 substituted or unsubstituted alkyl or aryl, e.g. CH 3? CF 3 , pheny!-CH 3 and phenyl-N0 2 ). 
The metal (M) may be any metal, but the prefered metals are BL n , SnL n , ZnL n , ZrL n , 
CuL n , SiL n , Ti L n , Al L n , and L n where L corresponds to metal ligands (e.g., halide, 
alkoxide, alkyl, aryl, alkenyl, heteroaryl, phosphine sulfide, amido) many of which are 
acceptable and are known to those practiced in the art. The halide could be attached to 

10 aromatic or heteroaromatic functionality, alkenyl functionality, or it could be attached to 
an aliphatic functionality. Addition can occur directly, or can be catalyzed by transition 
metals. Numerous methods are known to those practiced in the art [e.g. (Hegedus, L. 
S. (1994) in Transition Metals in the Synthesis of Complex Organic Molecules pp. 65- 
129, University Science Books, Mill Valley), (Knochel et al. Chemical Reviews 

15 93:2117-2188 (1993)), (Miyaura et al.. Chemical Reviews 95: 2457-2483 (1995)), and 
(Farina et al.(Paquette, Series Editor in Chief), Organic Reactions, John Wiley & Sons, 
New York, 1997, vol. 50, pp. 1-653)]. R 8 may be H, or a straight chain or branched 
alkyl group of length 1 to 10. which may incorporate from 1 to 10 heteroatoms (N, O, 
P, S) within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, 

20 aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be an aryl or heteroaryl group that is 
optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 



CTBF 



M-R 8 ^ I CTBF | — R 8 



25 (ii). X may be a halide or a sulfonate group (OS0 2 R where R is 

substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The BLFGs may be prepared in a two step process. In the first step, an organometallic 
reagent is prepared where the metal (M) may be any metal, but the prefered metals are 
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BL n , SnL n , ZnL n , ZrL n , CuL n , SiL n , Ti L n , Al L n , and L n where L corresponds to metal 
ligands (e.g., halide, alkoxide, alkyl, aryl, alkenyl, heteroaryl, phosphme sulfide, 
amido) many of which are acceptable and are known to those practiced in the art. In the 
second step, carbon-carbon bond formation is performed to generate the BLFGs. 
5 Numerous methods for this two step process are known to those practiced in the art [e.g. 
(Hegedus, L. S. (1994) in Transition Metals in the Synthesis of Complex Organic 
Molecules pp. 65-129, University Science Books, Mill Valley), (Knochel et al. 
Chemical Reviews 93:2117-2188 (1993)), (Miyaura et al.. Chemical Reviews 95: 2457- 
2483 (1995)), and (Farina et al.(Paquette, Series Editor in Chief), Organic Reactions, 

10 John Wiley & Sons, New York, 1997, vol. 50, pp. 1-653)]. R 8 may be H, or a straight 
chain or branched alkyl group of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to five R 9 
groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, 
alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be an aryl or 

15 heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 

(1) CTBF I — X ^ I CTBF I— M 



(2) I CTBF I M + R 8 X [ CTBF | — R 8 

(iii). X may be a halide or a sulfonate group (OSO : R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 

20 The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A palladium-mediated Heck reaction or related transtion metal 
mediated coupling reaction would be performed [e.g., (de Meijere, A. et al., Angew, 
Chem. Int. Ed, Engl., 33:2379-2411 (1994) and references therein) and Heck, 
Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. For the 

25 alkene, R 8 - R 9 may be straight chain or branched alkyl groups of length 1 to 10, which 
may incorporate from 1 to 10 heteroatoms (N, O, P, S) within the chain. R 8 - R 9 may 
also be appended with up to five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy 
ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono, 
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sulphono). R 8 - R 9 may also be aryl or heteroaryl groups that are optionally substituted 
(alkyl, aryl, heteroaryl, carboxy ester, carboxamide. amino, N-acylamino, alkoxy, 
hydroxy, mercapto, phosphono). 




(iv). X may be a halide or a sulfonate group (OSO : R where R is 
substituted or unsubstituted alkyl or aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 
The halide would preferably be attached to aromatic, heteroaromatic functionality, or 
alkenyl functionality. A copper-mediated, palladium-mediated reaction, or related 

15 transtion metal mediated coupling reaction would be performed. Numerous methods are 
known to those practiced in the art [e.g., (March et al., supra, p. 481), (Sonagashira, 
K. in Comprehensive Organic Synthesis; Trost, B. M., Fleming, I., Eds.; Pergamon 
Press: New York, 1991, vol. 3 pp. 521-549), and (Rossi et al., Org. Prep. Proced. Int. 
27:129-160 (1995)]. For the alkyne, R 8 may be H, straight chain or branched alkyl 

20 groups of length 1 to 10, which may incorporate from 1 to 10 heteroatoms (N, O, P, S) 
within the chain. R 8 may also be appended with up to five R 9 groups (R 9 is alkyl, aryl, 
heteroaryl, carboxy ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, 
mercapto, phosphono, sulphono). R 8 may also be a aryl or heteroaryl group that is 
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10 



15 



20 



25 



optionally substituted (alkyL aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono). 

v + = d8 . 



CTBF 



CTBF 



10. The alkenyl group as the linkage functional group and the corresponding blocked 
linkage functional groups 

(a). The linkage functional group is the alkenyl group. These CTBFs may be 
available commercially, or may be prepared by a variety of known methods to those 
practiced in the art. 

CTBFs may be represented as follows, where as many as four CTBF fragments 
(A through D) may be present on the alkene functionality . 



CTBF- 



Part A 



X yjCTBF-— 

/ \| CTRF 



CTBF- 



Part B 



Part C 



CTBF- 



Part D 



(b). Alternatively the library may be assembled with modified alkene BLFGs. 

(i). The alkene BLFGs may be prepared by performing a transition metal- 
mediated (typically Pd) catalyzed Heck reaction or related reactions on trisubstituted, 
disubstituted or the vinyl functionalized CTBFs [e.g., (de Meijere, A. et al., Angew. 
Chem. Int. Ed. EngL, 33:2379-2411 (1994) and references therein) and Heck, 
Palladium Reagents in Organic Synthesis, Academic Press, London 1985)]. R 8 may be a 
straight chain or branched alkyl group of length 1 to 10, which may incorporate from 1 
to 10 heteroatoms (N, O, P, S) within the chain. R 8 may also be appended with up to 
five R 9 groups (R 9 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 may also be a aryl or 
heteroaryl group that is optionally substituted (alkyl, aryl, heteroaryl, carboxy ester, 
carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). X may be 
a halide or a sulfonate group (OS0 2 R where R is substituted or unsubstituted alkyl or 
aryl, e.g. CH 3 , CF 3 , phenyl-CH 3 and phenyl-N0 2 ). 

| CTBF Part A [ } CTBF Part C 



CTBF Part A j I 

CTBF Part B \ h 



+ x— R 8 



I CTBF 
| CTBF 



Part A ~"l R 8 

— Pan b r ^y 



CTBF Part C 
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(ii). The alkene BLFGs may be prepared by performing a transition metal 
(typically Ru or Mo) catalyzed olefin metathesis reaction [e.g., (Grubbs et al. Accounts 
of Chemical Research 28:446-452 (1995)) and (Schuster et al. Angewandte Chemie- 
International Edition in English 36:2037-2056 (1997))]. R 8 - R 14 may be straight chain 
5 or branched alkyl groups of length 1 to 10, which may incorporate from 1 to 10 
heteroatoms (N, O, P, S) within the chain. R 8 - R 14 may also be appended with up to 
five R 10 groups (R 10 is alkyl, aryl, heteroaryl, carboxy ester, carboxamide, amino, N- 
acylamino, alkoxy, hydroxy, mercapto, phosphono, sulphono). R 8 - R 14 may also be aryl 
or heteroaryl groups that are optionally substituted (alkyl, aryl, heteroaryl, carboxy 
10 ester, carboxamide, amino, N-acylamino, alkoxy, hydroxy, mercapto, phosphono). 
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77. Methods to Covalently Bond TBFs using the LFG with a Bifunctional Linker (BFL) to 
Produce Candidate Crosslinked Target Binding Molecules (CXL-TBFs). 
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15 Upon identification of the TBFs, crosslinking is accomplished with a BFL. All 

of the chemistry described for preparation of the BLFGs could be employed in the 
crosslinking step. One method would be to crosslink TBFs that have the same LFGs 
and employing the same crosslinking chemistry. A second method would be to crosslink 
TBFs that have the same LFGs, but employing different crosslinking chemistry. A third 

20 method would be to crosslink TBFs that have different LFGs which is most but not all 
cases would require different corsslinking chemistry. Each of these strategies use 
known methods to those practiced in the art. Examples of the three methods are 
provided below. 

Method 1. For example, two TBFs with aldehyde LFGs could be crosslinked 
25 employing a BFL that incorporates two O-substituted hydroxy lamines. 



Method 2. For example, two TBFs with aldehyde LFGs could be crosslinked 
employing a BFL that incorporates one O-substituted hydroxylamine and one acyl 
hydrazide. 
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Method 3. For example, one TBF with an aldehyde LFG and one TBF with an 
amine LFG could be crosslinked employing a BFL that incorporates one O-substituted 
hydroxylamine and one carboxylic acid. 

A hypothetical example for the preparation of heterolinkers would be to employ 
amino acids as heterolinkers. As shown below, the amino acid heterolinker serves to 
link TBFs with carboxylic acid LFGs to TBFs with amine LFGs. The amine LFG could 
be a primary amine or a secondary amine (not shown). Many methods that are known to 
10 those practiced in the art could be used to prepare the CXL-TBFs using amino acid 
heterolinkers. One of the methods is described below. 



0 HO 



For this sample method, the amino acid BFL is protected as the N-tert- 
butoxycarbonyl (Boc) derivative. Many W-Boc amino acids are commercially available, 
15 e.g., Novabiochem (San Diego, CA) and Neosystems (Strosbourg, France). Af-Boc 
protected amino acids can also be prepared by known methods to those practiced in the 
art (Bodansky et al., The Practice of Peptide Synthesis, Springer- Verlag, Berlin, 1984, 
18-20). 

The sample method is illustrated in Scheme 1 . Sample experimental procedures 
20 are provided below. 

Step 1 (Desai, et al., Tetrahedron Letters, 34, 7685-7688, (1993)) 

To a suspension of polymer-bound carbodimide (1.5 mmol) in chloroform (10 
mL) is added the N-Boc protected amino acid 2 (0.55 mmol) and the amine-substiruted 
TBF 1 (0.50 mmol). After the reaction mixture is shaken overnight at room 
25 temperature, the mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) 
and the combined filtrate is evaporated in vacuo to yield 3. 

Step 2 
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To compound 3 (0.5 mmol) is added 5 mL of a solution of 4.0 M hydrochloric acid in 
dioxane (Aldrich, Milwaukee, WI). The solution is stirred for one hour at room 
temperature and then evaporated to remove the solvent and excess hydrochloric acid. 
The product is diluted with 5 mL of methanol and concentration is repeated to provide 4. 
5 Step 3 (DesaL et aL, Tetrahedron Letters, 34, 7685-7688, (1993)) 

To a suspension of polymer-bound carbodiimide (1.5 mmol) in chloroform (10 
mL)is added intermediate 4 (0.50 mmol) and the carboxylic acid-substituted TBF 5 
(0.55 mmol). After the reaction mixture is shaken overnight at room temperature the 
mixture is filtered. The resin is washed with chloroform (3 x 7.5 mL) and the combined 
10 filtrate is evaporated in vacuo to yield product 6, which is the desired CXL-TBF. 

Scheme 1 

Step 1 : 

| TBF n | NH 2 + BocHN | Linker " ° " ^ BocHN ^7^7] -^ ""Llg^J 

0 O 
1 2 3 



Step 2: 
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Step 3: 
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EXPERIMENTAL 

Unless otherwise noted, materials were obtained from commercial suppliers and 
used without further purifications. Aldehydes were purchased from Aldrich Chemical 
Company, Inc. (Milwaukee, WI). Anhydrous dimethylsulfoxide (DMSO) and acetic 
5 acid were purchased from Fischer (Pittsburg, PA). Soluble CD4 (sCD4) was purchased 
from Intracel Corporation (Issaquah, WA), gpl20 and anti-gpl20 antibody were 
purchased from DuPont (Wilmington, DE) and ophenylenediamine peroxidase substrate 
tablet sets were purchased from Sigma Chemical Co, (St. Louis, MO). Reactions were 
carried out in commercially available Beckman 2 ml deep-well microtiter plates. 
10 EXAMPLE I 

Pharmacophore Recombination for the Identification of Compounds Capable 

of 

Inhibiting the Interaction Between gp!20 and CD4 

To demonstrate the principle of pharmacophore recombination, we established a 
15 biochemical screen for the inhibition of gpl20-CD4 binding. This assay measures the 
ability of small molecules to inhibit the binding of gpl20 to sCD4 that is immobilized on 
a microtiter plate. Binding of sCD4 was quantified with an anti-gpl20 antibody 
conjugated to horseradish peroxidase. 

General procedure for the synthesis of an oxime compound library 

20 For several reasons, we chose to initially employ O-methyl oximes, rather than 

aldehydes, for the initial compound building block library. First, O-methyl oximes best 
model the pharmacophore units in the final oxime coupled dimers. Second, O-methyl 
oximes are more soluble in aqueous solution than their more hydrophobic aldehyde 
precursors. Also, the oxime functionality is clearly not inherently toxic and does not 

25 interfere with good pharmacokinetics or cell permeability since oximes are present in 
many drugs. Finally, the O-methyl oximes are easily prepared in a single step 
condensation of aldehydes with O-methyl hydroxylamine, without requiring purification 
of the resultant product. The chemical condensation of an aldehyde with O-methyl 
hydroxylamine to provide an oxime compound is shown in Figure 1. 

30 In the first step of the method, the initial oxime library was synthesized by 

separately condensing O-methyl hydroxylamine with 252 different aldehydes in a DMSO 
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solution. The oxime library was prepared in a spatially separate fashion in a microliter 
plate format such that each well contained a single oxime compound. More specifically, 
in each well of a microtiter plate, a DMSO solution of an unique aldehyde (0.188 ml, 
0.15 M, 0.028 mmol) was added. To this solution, a DMSO solution of 0-methyl 
5 hydroxylamine (0.083 ml, 0.5 M, 0.042 mmol) was then added followed by addition of 
a DMSO solution of acetic acid (0.023 ml, 0.5 M, 0.011 mmol). The plates were 
allowed to sit at room temperature overnight during which time condensation occurred, 
thereby providing the 252 member library of oxime compounds. 

Assay to determine which oxime compounds are capable of inhibiting the interaction 

10 between gpl20 and CD4 

The 252 member oxime compound library prepared as described above was then 
screened for the presence of compounds capable of inhibiting the interaction between 
gpl20 and CD4 in a standard ELISA assay. For the gpl20-CD4 ELISA assay, an 
Immulon-2 microtiter plate was incubated overnight at 4 °C with 70 ng of sCD4 in 100 

15 \x\ of carbonate buffer. The solution was removed from the plate and washed three times 
with phosphate buffered saline (PBS) at pH 7.4. The plate was blocked with 150 \i\ 
PBS-Tween-BSA (0.5% BSA, 0.05% Tween-20) for 1 h at room temperature and then 
washed again. gpl20 (1 ng) in 50 \x\ of PBS and 50 \i\ of test organic oxime compound 
(3 mM), 40 jal PBS, 10 \x\ were added and incubated for 1 h at room temperature. The 

20 plate was then washed and 100 of anti-gpl20 conjugated horseradish peroxidase was 
added and incubated for 1 h at room temperature. The bound gp!20 was then 
quantitated with <?-phenylenediamine as a substrate. 

The results of these assays demonstrated that 30 of the 252 oxime compounds 
were capable of inhibiting the interaction between gpl20 and CD4, wherein the 

25 approximate EC 50 values ranged from about 20 jaM to 500 jaM. The structurally related 
aldehyde analogs of 30 of these oximes showed diverse structural motifs including 
chromones, phenols and furans (see Figure 2). 

Cross-linking of the top 30 structurally related aldehyde analogs to produce a library 
of candidate compounds and screening of those candidate compounds for the ability to 
30 inhibit the interaction between gpl20 and CD4 
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Each of the 30 structurally related aldehydes analogs corresponding to the 30 
oxime compounds identified above as being capable of inhibiting the interaction between 
gpl20 and CD4 were individually coupled to each of the other 29 aldehydes with a 
variety of linkers to produce a library of candidate compounds for binding to the target 
5 molecule. Each of the 30 individual aldehydes was linked with another aldehyde 
through an 0,0'-diamino-alkanediol linker to obtain the library of cross-linked candidate 
compounds. Each aldehyde combination was kept spatially separate, but an equimolar 
mixture of five different O,0'-diamino-alkanediol linkers were used in each coupling 
reaction to provide a 450-member library of cross-linked candidate compounds. The 

10 chemistry used for preparation of the compounds is shown in Figure 3 showing the 
synthesis of both homodimers and heterodimers. 

The five linkers employed each consisted of two hydroxylamine groups tethered 
to an aliphatic chain having either two, three, four, five or six methylene units. This 
allowed us to evaluate any distance dependency the two pharmacophores may have in the 

15 binding site. Linkers were prepared as follows. To a round-bottomed flask was added 
alkyl dibromide (20.2 mmol), /V-hydroxyphthalimide (36.8 mmol, "1.8 equiv) and 
dimethylformamide (90 ml). The flask was cooled to 0 °C and 1,8- 
diazabicyclo[5.4.0]undec-7-ene (40.5 mmol) was added dropwise with stirring. The 
reaction was allowed to warm to room temperature and was then stirred overnight. The 

20 reaction mixture was then poured into 1M HC1 (500 ml). The resulting white solid 
precipitate was washed with water (3 x 50 ml) and methanol (3 x 50 ml) and sent onto 
the next step without further purification. 

13.9 mmol of the crude bis-TV-alkoxyphthalimide was then added to a round- 
bottomed flask in combination with dimethoxyethylene glycol (150 ml). To another 

25 flask was added hydrazine monohydrate (41.8 mmol, 3 equiv.) and dimethoxyethylene 
glycol (100 ml). The suspension of bis-Af-alkoxyphthalimide was added slowly with 
stirring to the hydrazine solution. The flask was refluxed for 3 h, allowed to cool to 
room temperature and the resulting precipitate was filtered away. The remaining 
supernatant solution was concentrated and the resulting yellowish-oil was purified by 

30 Kugelrohr distillation (0.01 mm Hg, 60-70 °C) and column chromatography (89:9:2 
CHCl 3 /MeOH/NH 4 OH). 
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For 0,0*-diamino-l ,4-butanedioI), the general synthesis procedure described 
above was followed. IR (film from CH 2 C1 2 ): 3412.8,3310.0,2942.7,2866.3 cm 1 . l U 
NMR (400 MHz, CDC1 3 ): 5 5.31 (br s, 4H), 3.65 (m, 4H), 1.57 (m, 4H). 13 C NMR 
(400 MHz, CDC1 3 ): 5 75.5, 24.8. Anal. Calcd for C 4 H l2 0 2 N 2 : C, 39.99; H, 10.07; N, 
5 23.32. Found: C, 40.17; H, 9.90; N, 23. 12. 

Once the five linkers were obtained, the 450 linked aldehyde combinations were 
prepared as follows. In each well, a DMSO solution of each of two different aldehydes 
(0.045 ml, 0.15 M, 0.007 mmol each) was added. To this solution, a DMSO solution of 
an equimolar mixture of the five linkers (0.025 ml, 0.3 M of each linker, 0.007 mmol) 
10 was added followed by a DMSO solution of acetic acid (0.005 ml, 0.5 M, 0.003 mmol). 
The plates were allowed to sit at room temperature overnight to allow for potential 
ligand formation. 

Each of the 450 members of the potential ligand library was then tested at a 
concentration of 100 yiM each (i.e., a concentration that is 10-fold more dilute than the 

15 concentration employed in the initial oxime monomer screen) for the ability to inhibit the 
interaction between gpl20 and CD4 using the ELISA assay described above. The 
results from these assays demonstrated that more than 300 of the 450 members of the 
potential ligand library showed greater than 50% inhibition activity at the concentration 
employed. When the 450 members of the potential ligand library were tested at a 

20 concentration of 1 ±iM each for the ability to inhibit the interaction between gpl20 and 
CD4, 17 of the ligands showed greater than 50% inhibitory activity at that 
concentration. The chemical structures of 12 of the 17 most active aldehyde precursors 
are shown in Figure 4. 

Evaluation of linker length dependence on activity 

25 The 17 linked aldehyde combinations with the greatest activity were then 

resynthesized as described above with a unique linker per well (85 separate wells) so as 
to evaluate any linker-length dependency on binding. Screening each of the 85 ligand 
compounds at 1 (iM each showed that ligands that incorporated pharmacophore 1 (see 
Figure 5) were more potent than the other ligands and that there indeed was a 

30 dependence upon linker length. Specifically, ligands with linkers having either 4 or 5 
methylene units were much more active than ligands with linkers having 6 methylene 
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units, whereas ligands with linkers having either 2 or 3 methylene units were only 
slightly more active than ligands with linkers having either 4 or 5 methylene units. 
Biological and Analytical Characterization of Representative Ligands 

One of the ligands that exhibited strong activity for inhibiting the interaction 
5 between gpl20 and CD4 (as shown in Figure 6) was resynthesized with each of the five 
different linkers on a large scale and was then purified by column chromatography. 
Column chromatography purification enabled isolation of the heterodimer from the 
homodimer. 

Large scale synthesis of organic oxime compounds was performed as follows. 

10 To a flame-dried round-bottomed flask was added aldehyde (0.82 mmol) and DMSO (8 
ml). A 0.9 M 0-methyl hydroxylamine (1.4 ml) was then added and the reaction 
mixture was allowed to stir at room temperature overnight. The reaction was poured 
into methylene chloride (50 ml), washed with H 2 0 (3 x 20 ml), dried and concentrated. 
Silica gel chromatography provided the pure organic oxime compounds. 

15 The oxime compounds made by this method were characterized as follows. 

(1) O-methvl oxime of 6-nitropiperonal 

Reaction of 6-nitropiperonal with (9-methyl hydroxylamine provided 
predominantly one oxime isomer which was purified by silica gel chromatography 
(10:90, EtOAc/hexanes). 'H NMR (400 MHz, CDC1 3 ): 5 8.61 (s, 1H), 7.53 (s, 1H), 
20 7.37 (s, 1H), 6.15 (s, 2H), 4.00 (s, 3H). Anal. Calcd for C 9 H 8 O s N 2 : C, 48.22; H, 3.60; 
N, 12.50. Found: C, 48:40; H, 3.75; N, 12.56. 

(2) O-methvl oxime of 6,8-dichloro-3-formvlchromone 

Reaction of 6,8-dichloro-3-formylchromone provided 1:1 cis/trans isomers which 
were isolated by silica gel chromatography (20:80, CH 2 Cl 2 /hexanes). 
25 Isomer 1: ] H NMR (400 MHz, CDC1 3 ): 5 8.48 (s, 1H), 8.24 (s, 1H), 8.12 (d, 1H, J = 
2.5), 7.75 (d, 1H, J = 2.5), 3.97 (s, 3H). Anal. Calcd for C n H 7 0 3 NCl 2 : C, 48.56; H, 
2.59; N, 5.15. Found: C, 48.44; H, 2.47; N, 5.03. 

Isomer 2: *H NMR (400 MHz, CDC1 3 ): 5 9.45 (s, 1H), 8.12 (d, 1H, J = 2.5), 7.75 (d, 
2H, J = 2.5), 4.07 (3H). Anal. Calcd for C U H 7 0 3 NC1 2 : C, 48.56; H, 2.59; N, 5.15. 
30 Found: C, 48.45; H,.2.49; N, 5.11. 
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Large scale synthesis of oxime ligands was performed as follows. To a flame- 
dried round-bottom flask was added 10 ml of DMSO and 1.03 mmol of each of the two 
aldehydes to be incorporated into the ligand. After all solids were dissolved, a solution 
of the appropriate linker (1.24 mmol) in 1 ml of DMSO was added dropwise, followed 
5 by the addition of acetic acid (0.72 mmol). The reaction mixture was allowed to stir at 
room temperature overnight. The reaction was then poured into methylene chloride (50 
ml), washed with H 2 0 (3 x 20 ml), dried and concentrated. Silica gel chromatography 
provided the isolated homo/heterodimers. Cis/irans isomers, when present, were not 
separated and were purified as mixtures of isomers. The oxime dimers made by this 
10 method were characterized below. 

(1) Oxime heterodimer of 6,8-dichloro-3-formyIchromone and 6-nitropiperonah 
linker containing 4 methylene units 

The heterodimer was separated from the homodimers by silica gel 
chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
15 characterized as a 1:1 mixture of cis/trans isomers. Anal. Calcd for C^H^OgNjC^: C, 
50.59; H, 3.28; N, 8.05. Found: C, 50.70; H, 3.40; N, 7.89. 

Isomer 1: 'H NMR (400 MHz, CDCI 3 ): 5 9.42 (s, 1H), 8.55 (s, 1H), 8.00 (s, 1H), 
7.67 (m, 2H), 7.39 (s, 1H), 7.27 (s, 1H), 6.09 (s, 2H), 4.22 (m, 4H), 1.84 (m, 4H). 

Isomer 2: l H NMR (400 MHz, CDC1 3 ): 8 8.54 (s, 1H), 8.41 (s, 1H), 8.16 (s, 1H), 
20 7.99 (s, 1H), 7.67 (s, 1H), 7.40 (s, 1H), 7.24 (s, 1H), 6.08 (s, 2H), 4.22 (m, 4H), 1.83 
(m, 4H). 

(2) Oxime heterodimer of 6 1 8-dichloro-3-formylchromone and 6-nitropiperonal, 
linker containing 5 methylene units 

The heterodimer was separated from the homodimers by silica gel 
25 chromatography (20:80, EtOAc/hexanes). The heterodimer was isolated and 
characterized as a 1.5:1 mixture of cis/trans isomers. Anal. Calcd for C 23 H 19 0 8 N 3 C1 2 : 
C, 51.51; H, 3.57; N, 7.83. Found: C, 51 .68; H, 3.70; N, 7.66. 

Isomer 1: *H NMR (400 MHz, CDC1 3 ): 5 8.62 (s, 1H), 8.47 (s, 1H), 8.24 (s, 1H), 
8.10 (s, 1H), 7.74 (s, 1H), 7.73 (s, 1H), 7.37 (s, 1H), 6.14 (s, 2H), 4.20 (m, 4H), 1.8 
30 (m, 4H), 1.53 (m, 2H). 
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Isomer 2: l H NMR (400 MHz, CDC1 3 ): 5 9.47 (s, 1H), 8.61 (s. 1H), 8.11 (s, 1H), 
7.75 (s, 2H), 7.48 (s, 1H), 7.33 (s, 1H), 6.14 (s, 2H), 4.30 (t, 2H, J = 6.6), 4.20 (m, 
2H), 1.80 (m, 4H), 1.53 (m, 2H). 

These purified heterodimers and homodimers were then tested as described above 
5 for the ability to inhibit the interaction between gpl20 and CD4. The results of these 
assays demonstrated that heterodimers shown in Figure 6 having a linker containing 
from 2 to 5 methylene units exhibited EC 50 's ranging from 0.6 to 1.5 \xM and showing 
10- to 20-fold enhancement in inhibitory activity over the compound shown in Figure 5 
(EC 50 in the range of about 10-15 |iM). The other compound that was incorporated into 

10 the heterodimer had an EC 50 of greater than 50 ^M. 

The heterodimers shown in Figure 6 having linkers containing from 2 to 5 
methylene units are of comparable potency to the most potent compounds that have been 
identified to date that block the CD4/gpl20 interaction (Tanaka et al., 7. Antibiotics 
50:58 (1997), Sun et al., J. Antibiotics 49:689 (1997), Jarvest et al., Bio. Med. Chem. 

15 Lett. 3:2851 (1993) and Chen et al., Proc. NatL Acad. Sci. USA 89:5872 (1992)). In 
addition, these ligand heterodimers are considerably less complex than previously 
identified compounds with comparable activity. Further optimization of the optimal 
building block and linker combinations could presumably be accomplished by evaluating 
a larger range of linkers with enhanced rigidity or by incorporating analogs of the 

20 optimal aldehyde precursors. 

EXAMPLE 2 

Pharmacophore Recombination Using N,N-Dimethvlamines and Other Diamine Linkers 

In addition to the use of aldehydes and oximes for the pharmacophore 
recombination method as described above, additional chemistries also find use. In this 

25 example, the organic compound building blocks are N,N-dimethylamine compounds that 
are prepared by reductive amination of starting aldehydes and dimethylamine using 
support-bound triacetoxyborohydride (Kaldor et al.. Tetrahedron Lett. 37:7193-7196 
(1996)). The chemistry of these reactions is shown in Figure 7. Removal of the 
support-bound reducing agent by filtration followed by concentration to remove the 

30 volatile, excess dimethylamine then provides the pure N,N-dimethylamine monomer 
building blocks. Alternatively, the N,N-dimethylamine building blocks may be obtained 
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by reduction using a sodium borohydride-based reducing agent in solution. The 
resulting amine product is then isolated from the excess reducing agent or aldehyde by 
passing down an acidic ion exchange column. The amine product is then obtained by 
elution from the ion exchange column with a volatile amine such as ammonia followed 
5 by concentration. 

Linkage of the N,N-dimethylamine building blocks can be accomplished through 
the use of diamine linkers of which many are commercially available and many more can 
be readily prepared using well known methodology. The commercial availability of the 
diamine linkers allows rapid optimization of linker length, rigidity and orientation. An 

10 exemplary synthesis sequence is shown in Figure 8. Specifically, support-bound 
chloride (3) (or other support-bound halide) is treated with excess of a diamine to 
provide an amine-derivatized support (4). Acylation of the amine functionality then 
provides support-bound formamide or carbamate (5). Reduction then provides support- 
bound secondary amine (6). Reductive amination then introduces one of the 

15 pharmacophore elements (7). Acid treatment then releases a secondary amine from the 
support, which can then be treated with the second pharmacophore monomer and sodium 
triacetoxyborohydride to provide the desired pharmacophore heterodimer (either 
support-bound reagent or alternative scavenging methods may be employed). Initial 
attachment of the diamine to the support can be accomplished using other support-bound 

20 alkyl halides or could be accomplished by reductive amination of a support-bound 
aldehyde or ketone. Fewer linkers are available that contain two secondary amine 
groups, but these can also be incorporated, in this case, the acylation step (4 to 5 in 
Figure 8) and the subsequent reduction step (5 to 6 in Figure 8) would be eliminated. 

The foregoing description details specific methods which can be employed to 

25 practice the present invention. Having detailed such specific methods, those skilled in 
the art will well enough know how to devise alternative reliable methods at arriving at 
the same information in using the fruits of the present invention. Thus, however, 
detailed the foregoing may appear in text, it should not be construed as limiting the 
overall scope thereof; rather, the ambit of the present invention is to be determined only 

30 by the lawful construction of the appended claims. All documents cited herein are 
expressly incorporated by reference. 
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WE CLAIM: 

1. A method for identifying a drug lead compound that binds to a target 
biological molecule (TBM) of interest, the method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF) 
capable of being chemically cross-linked by a cross-linker to provide candidate cross- 
linked target binding fragments for binding to the target biological molecule; 

(b) screening the library of candidate target binding fragments to identify at 
least first and second candidate target binding fragments that bind to the target biological 
molecule; 

(c) chemically cross-linking the at least first and second candidate target 
binding fragments or structurally related analogs thereof with a cross-linker to provide a 
library of candidate cross-linked target binding fragments for binding to the target 
biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound that 
binds to the target biological molecule. 

2. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to the 
target biological molecule with a K d of from about 5 mM to about 0.05 mM. 

3. The method according to Claim 1, wherein at least one of the candidate 
target binding fragments of the library of candidate target binding fragments binds to the 
target biological molecule with a K d of from about 3 mM to about 0. 1 mM. 

4. The method according to Claim 1, wherein the drug lead compound 
identified in step (d) binds to the target biological molecule with a K d of 500 nM or 
lower. 

5. The method according to Claim 1, wherein the screening steps (b) and (c) 
consist essentially of an in vitro biological assay. 
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6. The method according to Claim 1, wherein the library of candidate cross- 
linked target binding fragments for binding to the target biological molecule comprises 
homodimeric or heterodimeric candidate cross-linked target binding fragments. 

7. The method according to Claim 1, wherein the library of candidate target 
binding fragments comprises candidate target binding fragments of less than about 500 
daltons. 

8. The method according to Claim 1 wherein the library of candidate cross- 
linked target binding fragments comprises candidate cross-linked target binding 
fragments of less than about 750 daltons. 

9. The method according to Claim 1, wherein the target biological molecule 
is a human or human pathogen protein. 

10. The method according to Claim 9, wherein the protein is an enzyme, a 
human hormone, a human receptor and fragments thereof having nitrogen's in the 
protein present in their naturally occurring isotopic abundance. 

11. The method according to Claim 1, wherein at least one of the screening 
steps (b) and (d) is accomplished by ELISA assay. 

12. A method for inhibiting the binding of a first biological molecule to a 
second biological molecule that binds to the first biological molecule, the method 
comprising: 

contacting a system comprising both the first and second biological molecules 
with a binding inhibitory amount of a drug lead compound identified according to the 
method of Claim 1, wherein the drug lead compound binds to the first biological 
molecule and inhibits its ability to bind to the second biological molecule. 

13. The method according to Claim 12, wherein the first and second 
biological molecules are human proteins. 
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14. The method according to Claim 13, wherein the first and second 
biological molecules are selected from the group; a human hormone, cytokine or 
chemokine, a human receptor and fragments thereof 

15. A method for identifying a drug lead compound that binds to a target 
biological molecule of interest, the method comprising: 

(a) assenbling a library of candidate target binding fragments, each fragment 
containing an oxime linking group; 

(b) screening the library of candidate target binding fragments or monomers 
to identify at least first and second oxime containing candidate target binding fragments 
that bind to the target biological molecule; 

(c) chemically crosslinking the aldehyde analogs of the at least first and 
second oxime containing candidate target binding fragments with an 0,0'-diamino- 
alkanediol cross-linker to provide a library of oxime containing candidate cross-linked 
target binding fragments for binding to the target biological molecule; and 

(d) screening the library obtained in (c) to identify a drug lead compound that 
binds to the target biological molecule. 

16. A method comprising: 

(a) assembling a library of candidate target binding fragments (CTBF), each 
fragment having a linkable functional group (LFG) or blocked form thereof (BLFG), the 
blocked form containing a linking group (LG); 

(b) contacting the candidate target binding fragments with a target biological 
molecule (TBM); 

(c) measuring a change in a first physical association (PA-1) of the target 
biological molecule; 

(d) selecting target binding fragments (TBF) based on (c); 

(e) reacting selected target binding fragments having a linkable functional 
group with a cross-linker, having chemically compatible cross-reactive groups (CFG) 
with the LFG, under conditions suitable for forming a library of candidate cross-linked 
target binding fragments (CXL-TBF); 
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(0 contacting the candidate cross-linked target binding fragments with the 
target biological molecule (TBM); 

(g) measuring a change in a second physical association (PA-2) of the target 
biological molecule; 

(h) selecting cross-linked target binding fragments (XL-TBF) based on (g). 

17. The method of Claim 16 wherein the candidate target binding fragment 
contacted with the TBM contains a blocked linkage functional group (BLFG) containing 
linking group LG. 

18. The method of Claim 17 where in the linking group (LG) in BLFG is 
selected from the group; oxime, hydrazone, N-acyl hydrazone, secondary amine, tertiary 
amine, acetal, ketal, 1.2 amino alcohols, amide, N,N-disubstituted amides, thioamide, 
ureido. thioureido, carbamate, thiocarbamate, thiothiocarbamate, sulfonamide, 
carbonate, guanidino, amidino, thioester, ester, ether, 2-hydroxyether, 2- 
hydroxythioether, thioether, disulfide, alkane (alkylene), alkene (alkenylene) and alkyne 
(alkynylene). 

19. The method of Claim 18 wherein each candidate target binding fragment 
(CTBF) of step (b) contains the same linking group (LG) as is present in the candidate 
cross-linked target binding fragment (CXL-TBF) of step (f). 

20. The method of Claim 19 wherein the linking group (LG) is selected from 
the group; oxime, secondary amine, tertiary amine, amide, ureido, thioureido, 
sulfonamide and carbamate. 

21. The method of Claim 19 wherein two candidate target binding fragments 
(CTBF) selected from step (d) are cross-linked to form a candidate cross-linked target 
binding fragment (CXL-TBF) represented by 




where 
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TBF m is a first TBF selected from step (d) which contained LG, in its blocked 
linking group ; 

TBF n is a second TBF selected from step (d) which contained LG 2 in its blocked 
linking group; 

XL represents the cross-linker without the chemically compatible cross-reactive 
functional groups; 

LG] represents the linking group in the first TBF; and 

LG 2 represents the linking group in the second TBF. 

22. The method of Claim 21 wherein LG, and LG 2 are the same or are 
different and are selected from the group oxime, secondary amine, tertiary amine, 
amide, ureido, thioureido, sulfonamide and carbamate. 

23. The method of Claim 21 wherein TBF m and TBF n are the same or are 
different. 

23. The method of Claim 21 where XL is selected from the group of alkanes: 
methylene, ethylene, propylene, butylene, pentylene, hexylene and heptylene, optionally 
containing 0, 1, 2 or 3 ether linkages and from 1-3 double bonds and aryls: ortho-, 
meta- or para- C 0 -C 6 -alkyl-phenyl-C 0 -C 6 -alkylene . 

24. The method of Claim 16 wherein the candidate cross-linked target binding 
fragments are represented by the formula: 



TBF m LGj- XL |j 



y 2 



TBF n 



where 

TBF m represents a first TBF selected from step (d); 
TBF n represents a second TBF selected from step (d); 

XL represents the cross-linker without the chemically compatible cross-reactive 
functional groups selected from the group 
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C 0 -C (0 -alkylene, 

C 0 -C 6 -alkyl-C 6 -C 10 -aryI-C 0 -C 6 -alkylene, 
C r C 6 -alkyl-N(R,)-C r C 6 -alkylene, 

(C r C 6 -alkyl-0-C r C 6 -alkylene) n , where n=l, 2, 3 or 4; 
LG] and LG 2 are linking groups independently selected from the group 
-C(R a ) = N-0», -0-N=C(R a )-, -CH 2 -N(R>, -N(R a )-CH r , 
-C( = 0)-N(R,)- f -N(R 3 )-C(=0)-, -N(R>C( = 0)-0- f -0-C( = 0)-N(BO-, 
-N(R 3 )-C( = 0)-N(R b )-, -N(R 3 )-C( = 0)-N(R b )-, -S0 2 -N(R a )- and 
-N(R a )-S0 2 -; 

R3 and R b are independently selected from the group 

hydrogen, C r C I0 -aIkyl, C 0 -C I0 -alkyl-Q-C 10 -aryI, C 6 -C 10 -aryl-C 0 -C I0 -alkyl, C 0 - 
C 10 -alkyl-heterocycle-C 0 -C I0 -alkyl, C r C 6 -alkyI-NH-C r C 6 -alkyl, C 0 -C I0 -alkyl-O-C 0 -C l0 - 
alkyl, C 0 -C I0 alkyI-C( = O)-C 0 -C 10 -alkyI, C 0 -C 10 -alkyl-NH-C( = O)-C 0 -C t0 -alkyU C 0 -C ]0 - 
alkyl-O-C( = O)-C 0 -C 10 -alkyl, where any alkyl, aryl or heterocycle is optionally 
substituted with C r C I0 -alkyl, C r C 10 -alkoxy, C 6 -C 10 -aryl, C 6 -C I0 -aryloxy, halo (F, CI, 
Br, I), hydroxy, carboxy, amino, nitro and S(0)o. 3 . 

25. The method of Claim 24 wherein the TBF m and TBF n from step (d) each 
independently bind to the target biological molecule with a K d of from about 3 mM to 
about 100 nM. 

26. The method of Claim 25 wherein the TBF m and TBF n from step (d) each 
independently bind to the target biological molecule with a K d of from about 2 mM to 
about 500 |iM. 

27. The method according to Claim 24, wherein the target biological molecule 
is a human or human pathogen protein. 

28. The method according to Claim 24, wherein the protein is an enzyme, a 
human hormone or a human receptor having nitrogen's in their naturally occuring 
isotopic abundance. 
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29. The method according to Claim 24, wherein at least one of steps (b) and 
(d) is accomplished by ELISA assay. 

30. The method of Claim 21 wherein 



TBF m 


— LGt- 


XL 


LGy— 
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TBF„ 



m— XL r~K 



TBF n 



TBF,, 



N^N I 1 \ 

I l f 

Ri R 2 \ 



TBF n 



TBF„ 



and 



J^_| XL [-^ s\ 



TBF n 



31. The method of Claim 16 wherein the candidate cross-linked target binding 
fragments are represented by the formulae: 



TBF n 



— LGt- 


XL 


L&j— 


TBF n 



and 



TBF m -part A 



TBF n -part C 



XL 



TBF -part B 



TBF n -part D 



where 

TBF m represents a first TBF selected from step (d); 
TBF n represents a second TBF selected from step (d); 

TBF m-part A and B represent TBF m from step (d) where each fragment is bonded 
to a single atom in LG 3 ; 

TBF n -part C and D represent TBF n from step (d) where each fragment is bonded 
to a single atom in LG 4 ; 

XL represents a cross-linker of the formula 
-(C 0 -C 2 -alkyl-L , -L 2 -L 3 -L 4 -L 5 -C 0 -C 2 -aIkyl)-; 

LG, and LG 2 are linking groups independently selected from the group 
-C(RJ = N-CK -0-N = C(R>, -CH.-NCRJ-, -N(RJ-CH 2 -, -C( = 0)-N(R>, -N<R,)- 
C( = 0)- f -N(R a )-C( = 0)-0- J -0-C( = 0)-N(R3)-, -N(R,).C( = 0)-N(R b )-, -N(R>C( = O)- 
N(R b K -S0 2 -N(R3)- and -N(R>S0 2 -; 

LG 3 and LG 4 are linking groups independently selected from the group 
>C = N-CK -0-N=C<, -CH 2 -N < , >N-CH 2 -, -C( = 0)-N<, >N-C(=OX >N- 
C(=0)-0-, -0-C( = 0)-N<, >N-C( = 0)-N(R b )-, -N(RJ~C( = 0)-N < , -S0 2 -N< and 
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>N-SO r , where < and > represent two bonds linking TBF-part A, B, C, or D to the 
single N or C atom in LG 3 or LG 4 ; 

and R b are independently selected from the group 
hydrogen, C r C I0 -alkyl, C 0 -C t0 -alkyl-C 6 -C 1(r aryl, C 6 -C l0 -aryl-C 0 -C I0 -alkyK C 0 -C 1(r alkyl- 
heterocycle-C 0 -C l0 -alkyl, C r C 6 -alkyl«NH-C r C 6 -alkyl, C 0 -C 10 -alkyl-O-C 0 -C (0 -alkyl, C 0 - 
C I0 alkyl-C( = O)-C 0 -C 10 -aIkyI t C 0 -C I0 -alkyl-NH-C( = O)-C 0 -C 10 -alkyl, C 0 -C 10 -alkyl-O- 
C( = O)-C 0 -C J0 -alkyl, where any alkyl, aryl or heterocycle is optionally substituted with 
C r C I0 -alkyl, C r C 10 -alkoxy, C 6 -C I0 -aryl, C 6 -C 10 -aryloxy, halo (F, CI, Br, I), hydroxy, 
carboxy, amino, nitro and S(OV 3 ; 

TBF m , TBF n , TBF m -part A, TBF^part B, TBF n -part C and TBF n -part D are each 
independently represented by formula I 



Cycle 1 and Cycle 2 are independently present or absent and are selected from a 
mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring having 5, 6 
or 7 atoms in the ring where the ring atoms are carbon or from 1-4 heteroatoms selected 
from; nitrogen, oxygen, and sulfur, and where any sulfnr ring atom may optionally be 

oxidized and any carbon ring atom may form a double bond with O, NR n and CR 1 ^', 

each ring nitrogen may be substituted with R n and any ring carbon may be substituted 



-A-(Cycle l)-B-(Cycle 2)-E 



(D 



Where 



with R d ; 



A and B are independently selected from 





and 





where: 



-94- 



BNSDOCID: <WO 994931 4A1JA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/4931 4 PCT/US99/06734 

L 1 is absent or may be selected from oxo (O), S(0) s , C( = 0), C( = N-R n ), 
C( = CR 1 R r ), C^R 1 '), C(R l ), C, het, N(R n ) or N; 

L is absent or may be selected from oxo (O), S(0) s > C( = 0), C( = N-R n ), 
C( = CR 2 R 2 '), C(R 2 R 2 '), C(R 2 ), C, het, N(R n ) or N; 

L is absent or may be selected from oxo (O), S(0) s , C( = 0), C( = N-R n ), 
C( = CR 3 R 3 \ C(R 3 R 3 ') C(R 3 ), C, het, N(R n ) or N; 

4 n 

L is absent or may be selected from oxo (O), S(0) g , C( = 0), C( = N-R ), 
C( = CR 4 R 4 '), C(R 4 R 4 '), C(R 4 ), C, NR n or N; and 

L 5 is absent or may be selected from oxo (O), S(0) s , C( = 0), C( = N-R n ), 

C(R 5 R 5 '), C( = CR 5 R 5 \ C(R 5 ), C, NR n or N; 

1 1 ' 2 2' 3 3' 4 4' 5 5' 
R,R , R , R ,R,R ,R,R ,R and R each are independently selected 

from R a , R a \ R c and U-Q-V-W; where s is 0-2 

Optionally, each R*-R 5 or NR n together with any other R ! -R 5 or NR n may form 
a mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring, each ring being a 
homo- or heterocycle having 5, 6 or 7 atoms in the ring, optionally each ring containing 
1-4 heteroatoms selected from N, O and S where any ring carbon or sulfur atom may 

optionally be oxidized, each ring nitrogen optionally substituted with R n and each ring 

carbon optionally substituted with R d ; 

E is -L X -L 2 -L 3 -R a ; 

R a is selected from the group; hydrogen, halo(F, CI, Br, I), halo(F, CI, Br, I)- 
Cj-Cjjalkyl, halo(F, CI, Br, D-Cj-Cjjalkoxy, hydroxy-Cj-C^lkyl, cyano, 

isocyanate, carboxy-Cj-Cj jalkyl, amino, Cq-Cj jalkyl-amino-CCj-Cgalkyl), C Q - 

Cj jalkyl-amino-dHCj-Cgalkyl), aminocarbonyl, C i" C i jalkylcarbonylamino, 

carboxamido, carbamoyl, carbamoyloxy, formyl; formyloxy, azido, nitro,. hydrazide, 
hydroxamic acid, imidazoyl, ureido, thioureido, thiocyanato, hydroxy, Cj-C^lkoxy, 
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mercapto, sulfonamide het, phenoxy, phenyl, benzyl, benzyloxy, benzamido, tosyl, 
morpholino, morpholinyl, piperazinyl, piperidinyl, pyrrolinyl. imidazolyl and indolyl; 

R is selected from the group of C 0 -C 1() a]ky]-Q-C 0 -C 6 alkyl, C Q -C 10 alkenyl-Q- 

C Q -C 6 alkyl, C 0 -C 10 alkynyl-Q-C 0 -C 6 alkyl, C 3 -C n cycloalky]-Q-C 0 -C 6 alkyl, 

C 10 cycloalkenyl-Q-C 0 -C 6 alkyl, Cj-Cgalkyl-C^C^aryl-Q-C^alkyl, Cg-C^aryl- 

C r C 6 alkyl-Q-C 0 -C 6 alkyl, CQ-Cgalkyl-het-Q-C^Cgalkyl, C 0 -C 6 aIkyl-Q-het-C 0 - 

C 6 alkyl, het-C 0 -C 6 alkyl-Q-C 0 -C 6 alkyl, C^C^lkyl-Q-Cg-C^aryl and Q-Cj-Cgalky, 

where any aryl or het is optionally substituted with 1-3 R d and any alkyl, alkenyl or 

alkynyl is optionally substituted with 1-3 R a ; 
a a ' 

R andR may join to form a 3-7 member homocyclic ring substituted with 1-3 

R a ; 

R c is selected from hydrogen and substituted or unsubstituted; amino, O-Cj- 
C g alkyl, amino-CCj-Cgalkyl), amino-dHCj-Cgalkyl), Cj-C^alkyl, C 2 -C 1() alkenyl, 
C 2 -C 1() alkynyl, C 3 -C n cycloalkyl, C 3 -C 1() cycloalkenyl, Cj-C^lkyl-C^-C^aryl, C g - 
C 10 aryI_C r C 6 alky1 ' C !- c 6 a lkyl-het, het-Cj-Cgalkyl, Cg-C^aryl and het, where the 

substituits on any alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or 
het are 1-3 R d ; 

R d is selected from R h and R p ; 

R is selected from the group OH, OCF 3 , OR c , SR m , halo(F, CI. Br, I), CN, 

isocyanate, N0 2 , CF 3> C 0 -C 6 alkyl-NR n R n ', C 0 -C 6 alkyl-C( = O)-NR n R n ', C 0 -C 6 alkyl- 

C(=0)-R a , Cj-Cgalkyl, C^Cgalkoxy, C 2 -C g alkenyl, C 2 -C g alkynyl, C 3 -C 6 cycloalkyl, 
C 3 -C 6 cycloalkenyl, Cj -Chalky 1-phenyl, phenyl-Cj -Chalky 1, Cj-Cgalkyloxycarbonyl, 
phenyl-C 0 -C 6 alkyloxy, Cj-Cgalkyl-het, het-Cj -Chalky 1, S0 2 -het, O-Cg-C^aryl, S0 2 - 
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C^-C^aryl, SO ? -Cj-Cgalkyl and het, where any alkyl, alkenyl or alkynyl may 

optionally be sustituted with 1-3 groups selected from OH. halo(F, CI, Br, I), nitro, 
amino and aminocarbonyl, where the substituents on any aryl or het are 1-2 hydroxy, 
halo(F, CI, Br, I), CF 3 , Cj-Cgalkyl, Cj-Cgalkoxy, nitro and amino; 

R m is selected from hydrogen, S-Oj -Chalky!, C( = 0)-C 1 -C 6 alkyl, C( = 0)- 
NR n R n ', Cj-CgalkyI, halo(F, CI, Br, D-Cj-Cgalkyi, benzyl and phenyl; 

R n is selected from the group R c , OH, OCF 3 , OR°, CN, isocyanate, NH- 

C( = 0)-0-R c , NH-C( = 0)-R c , NH-C( = 0)-NHR c , NH-S0 2 -R s , NH-S0 ? -NH-C( = 0)- 

R c , NH-C( = 0)-NH-S0 2 -R s , C( = 0)-0-R°, C( = 0)-R c , C( = 0)-NHR c , C(=0)-NH- 

C( = 0)-0-R°, C( = 0)-NH-C( = 0)-R c , C( = 0)-NH-S0 2 -R s , C( = 0)-NH-S0 2 -NHR c , 

S0 2 -R s , S0 2 -0-R°, S0 2 -N(R c ) 2 , S0 2 -NH-C( = 0)-0-R°, S0 2 -NH-C( = 0)-0-R° and 

S0 2 -NH-C( = 0)-R c ; 

R° is selected from hydrogen and substituted or unsubstituted C j -Chalky 1, C Q - 
C 6 alkyl-C 6 -C 1() aryl, Cj-Cgalkylcarbonyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 ~ 

CgCycloalkyl and benzoyl, where the substituits on any alkyl are 1-3 R a and the 

substituents on any aryl are 1-3 HP; 

R p is selected from the group; OH, halo(F, CI. Br, I), CN, isocyanate, OR°, 
SR m , SOR°, N0 2 , CF 3 , R c , NR n R n ' , N(R n )-C( = 0)-0-R°, N(R n )-C(=0)-R c , C Q - 

C 6 alkyl-S0 2 -R s , C 0 -C 6 alkyl-SO 2 -NR n R n ', C( = 0)-R c , 0-C( = 0)-R c , C(=0)-0-R° 

and C(=0)-NR R , where the substituits on any alkyl, alkenyl or alkynyl are 1-3 R a 
and the substituents on any aryl or het are 1-3 R d ; 
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R s is a substituted or unsubstituted group selected from; C 1 -C 0 alkyl, C n - 

1 o 2 

C g alkenyl, C^-Cgalkynyl, C 3 -C g cycloalkyl, C^C^cycloalkenyl, C^Cgalkyl-phenyl, 
phenyl-C 0 -C 6 a!kyl, C Q -C 6 alkyl-het and het-C Q -C 6 alkyl, where the substituits on any 

alkyl, alkenyl or alkynyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

het is any mono-, bi-, or tricyclic saturated, unsaturated, or aromatic ring where 
at least one ring is a 5-, 6- or 7-membered ring containing from one to four heteroatoms 
selected from the group nitrogen, oxygen, and sulfur, the 5-membered ring having from 
0 to 2 double bonds and the 6- or 7-membered ring having from 0 to 3 double bonds and 
where any carbon or sulfur atoms in the ring may optionally be oxidized, and where any 
nitrogen heteroatom may optionally be quaternized and where any ring may contain from 

0-3 R d ; 

U is an optionally substituted bivalent radical selected from the group; Cj- 
C 6 alkyl, C Q -C 6 alkyl-Q, C 2 -C 6 alkenyl-Q, and C 2 -C 6 alkynyl-Q, where the substituits on 

any alkyl, alkenyl or alkynyl are 1-3 R a ; 

Q is absent or is selected from the group; -O-, -S(O) -, -SO -N(R n )-, -N(R n )-, - 

s £ 

N(R n )-C(=0)-, -N(R n )-C( = 0)-0-, -N(R n )-S0 2 -, -C(=0)-, -C(=0)-0-, -het-,-C( = 0)- 

N(R n )-, -PO(OR c )0- and -P(0)0-, where s is 0-2 and the heterocyclic ringis substituted 
with 0-3 R h ; 

V is absent or is an optionally substituted bivalent group selected from Cj- 
C 6 alkyl, C 3 -C g cycloalkyl, C 0 -C 6 alkyl-C 6 -C 1() aryl, and Cg-C^alky-het, where the 

substituits on any alkyl are 1-3 R a and the substituents on any aryl or het are 1-3 R d ; 

W is selected from the group; hydrogen, -OR°, -SR m , -NR n R n \ -NH-C(=0)- 
0-R°, -NH-C( = 0)-NR n R n ', -NH-C(=0)-R c , -NH-S0 2 -R s , -NH-S0 2 -NR n R n ', -NH- 

S0 2 -NH-C(=0)-R c , -NH-C( = 0)-NH-S0 2 -R s , . -C(=0)-NH-C(=0)-0-R°, -C(=0)- 
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NH-C( = 0)-R c , -C( = 0)-NH-C( = 0)-NR n R n \ -C( = 0)-NH-S0 2 -R s , -C( = 0)-NH-S0 2 ~ 
NR n R n , -C( = S)-NR n R n ', -SC> 2 -R S , -SCyO-R 0 , -SC> 2 -NR n R n \ -S0 2 -NH-C( = 0)-0- 
R°, -S0 2 -NH-C( = 0)-NR n R n \ -S0 2 -NH-C( = 0)-R c , -0-C( = 0)-NR n R n ', -0-C( = 0)- 

R c , -0-C( = 0)-NH-C( = 0)-R c , -0-C( = 0)-NH-S0 2 -R s and -0-S0 2 -R s ; 

Optionally, TBF m -part A together with TBF m -part B and TBF n -part C together 
with TBF n -part D may independently form (Cycle 1) substituted with -B-(Cycle 2)-E.. 

32. A compound made by the method of Claim 30. 
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